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Abstract 

Nonequilibrium equalities have attracted considerable interest in the context of statistical 
mechanics and information thermodynamics. What is remarkable about nonequilibrium 
equalities is that they apply to rather general nonequilibrium situations beyond the linear 
response regime. However, nonequilibrium equalities are known to be inapplicable to some 
important situations. In this thesis, we introduce a concept of absolute irreversibility as a 
new class of irreversibility that encompasses the entire range of those irreversible situations 
to which the conventional nonequilibrium equalities are inapplicable. In mathematical 
terms, absolute irreversibility corresponds to the singular part of probability measure 
and can be separated from the ordinary irreversible part by Lebesgue’s decomposition 
theorem in measure theory. This theorem guarantees the uniqueness of the decomposition 
of probability measure into singular and nonsingular parts, which enables us to give a well- 
defined mathematical and physical meaning to absolute irreversibility. Consequently, we 
derive a new type of nonequilibrium equalities in the presence of absolute irreversibility. 
Inequalities derived from our nonequilibrium equalities give stronger restrictions on the 
entropy production during nonequilibrium processes than the conventional second-law like 
inequalities. Moreover, we present a new resolution of Gibbs’ paradox from the viewpoint 
of absolute irreversibility. This resolution applies to a classical mesoscopic regime, where 
two prevailing resolutions of Gibbs’ paradox break down. 
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Chapter 1 
Introduction 


In the mid-1990s, a significant breakthrough was achieved in the held of nonequilibrium 
statistical physics. Evans, Cohen and Morris numerically found a new symmetry of the 
probability distribution function of the entropy production rate in a steady-state shear- 
driven how [lj. This symmetry, later formulated in the form of huctuation theorems, 
was proven in chaotic systems by Gallavotti and Cohen [2], and later in various types of 
systems [33j6]. In this way, the huctuation theorems present a ubiquitous and universal 
structure residing in nonequilibrium systems. What is important about the huctuation 
theorems is that they apply to systems far from equilibrium. Moreover, they can be 
regarded as a generalized formulation of the linear response theory to rather general 
nonequilibrium situations [?]. In this respect, the huctuation theorems have attracted 
considerable attention. 

In the course of the development of the huctuation theorems, an important relation was 
found by Jarzynski M- The Jarzynski equality or the integral nonequilibrium equality 
relates the equilibrium free-energy difference between two configurations to an ensemble 
property of the work performed on the system during a rather general nonequilibrium pro¬ 
cess that starts from one of the two configurations. Thus, the Jarzynski equality enables 
us to estimate the free-energy difference by finite-time measurements in a nonequilibrium 
process outside of the linear response regime. Moreover, the Jarzynski equality concisely 
reproduces the second law of thermodynamics and the fluctuation-dissipation relation in 
the weak huctuation limit. In this way, the integral nonequilibrium equality is a funda¬ 
mental relation with experimental applications. 

Another significant relation is the Crooks huctuation theorem or the detailed nonequi¬ 
librium equality mm- The Crooks huctuation theorem compares the realization proba¬ 
bility of a trajectory in phase space under a given dynamics with that of the time-reversed 
trajectory under the time-reversed dynamics. The ratio of these two probabilities is ex¬ 
actly quantified by the exponentiated entropy production. Therefore, irreversibility of 
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a path under time reversal is quantitatively characterized by the entropy production of 
the path itself. Additionally, the Crooks fluctuation theorem succinctly reproduces the 
Jarzynski equality. 

Recently, the subject of nonequilibrium equalities marks a new development in the 
field of feedback control. The theory of feedback control dates back to Maxwell [12], In 
his textbook of thermodynamics, Maxwell pointed out that the second law can be violated 
if we have access to microscopic degrees of freedom of the system and illustrated the idea 
in his renowned gedankenexperiment later christened by Lord Kelvin as Maxwell’s de¬ 
mon. Maxwell’s demon is able to reduce the entropy of an isolated many-particle gas, by 
measuring the velocity of the particles and manipulating them based on the information 
of the measurement outcomes without expenditure of work. Maxwell’s demon triggered 
subsequent discussions on the relations between thermodynamics and information pro¬ 
cessing HM5]. Although Maxwell’s demon has been a purely imaginary object for about 
one and a half century, thanks to technological advances, the realization of Maxwell’s 
demon in real experiments is now within our hands mm- Hence, the theory of feed¬ 
back control has attracted considerable interest in these days, and quantitative relations 
between thermodynamic quantities and information were established [LSI !§] , forming a 
new field of information thermodynamics. One of the quantitative relation is known as 
the second law of information thermodynamics, which states that the entropy reduction 
in the feedback process is restricted by the amount of mutual information obtained in 
the measurement process. As the Jarzynski equality is a generalization of the second law 
of thermodynamics, the second law of information thermodynamics is generalized in the 
form of the integral nonequilibrium equality |201. Thus, nonequilibrium equalities are an 
actively developing field in the context of information thermodynamics. 

Despite of the wide applicability and interest of nonequilibrium equalities, they are 
known to be inapplicable to free expansion mm- Moreover, information-thermodynamic 
nonequilibrium equalities cannot apply to situations that involve such high-accuracy mea¬ 
surements as error-free measurements [23]. This inapplicability roots from divergence of 
the exponentiated entropy production, and can be circumvented at the level of the detailed 
nonequilibrium equalities [21]. However, to obtain the corresponding integral nonequilib¬ 
rium equalities, situation-specific modifications are needed, and moreover the form of the 
obtained equalities is rather unusual. There has been no unified strategy to derive these 
exceptional integral nonequilibrium equalities in the situations to which the conventional 
integral nonequilibrium equalities cannot apply. 

In this thesis, we propose a new concept of absolute irreversibility that constitutes 
those irreversible situations to which the conventional integral nonequilibrium equalities 
cannot apply. In physical terms, absolute irreversibility refers to those irreversible situa- 


5 


tions in which paths in the time-reversed dynamics do not have the corresponding paths 
in the original forward dynamics, which makes stark contrast to ordinary irreversible situ¬ 
ations, in which every time-reversed path has the corresponding original path. Therefore, 
in the context of the detailed nonequilibrium equalities, irreversibility is so strong that the 
entropy production diverges, whereas, in ordinary irreversible situations, irreversibility is 
quantitatively characterized by a finite entropy production. In mathematical terms, ab¬ 
solute irreversibility is characterized as the singular part of the probability measure in the 
time-reversed dynamics with respect to the probability measure in the original dynamics. 
Therefore, based on Lebesgue’s decomposition theorem in measure theory [251 26] , the 
absolutely irreversible part is uniquely separated from the ordinary irreversible part. As 
a result, we obtain nonequilibrium integral equalities that are applicable to absolutely 
irreversible situations [27]. Furthermore, our nonequilibrium equalities give tighter re¬ 
strictions on the entropy production than the conventional second-law like inequalities 

mm- 

As an illustrative application of our integral nonequilibrium equalities and the notion 
of absolute irreversibility, we consider the problem of gas mixing, which is what Gibbs’ 
paradox deals with [29]. Gibbs’ paradox is qualitatively resolved once we recognize the 
equivocal nature of the thermodynamic entropy [3TT1T32] . Although the standard quan¬ 
titative resolution of Gibbs’ paradox in many textbooks is based on quantum statistical 
mechanics, this resolution is indeed irrelevant to Gibbs’ paradox [31, [32] • Pauli gave 
a correct quantitative analysis of Gibbs’ paradox based on the extensivity of the ther¬ 
modynamic entropy [33, 32]. However, this resolution holds only in the thermodynamic 
limit, and ignores any sub-leading effects. Based on our nonequilibrium equalities in the 
presence of absolute irreversibility, we give a quantitative resolution of Gibbs’ paradox 
that is applicable even to a classical mesoscopic regime, where sub-leading effects play an 
important role. 

This thesis is organized as follows. In Chap. 2, we briefly review history of nonequi¬ 
librium equalities and a unified approach to derive them. In Chap. 3, we review a part 
of historical discussions on Maxwell’s demon and some fundamental relations under mea¬ 
surements and feedback control including the second law of information thermodynamics. 
Then, we review and derive information-thermodynamic nonequilibrium equalities. In 
Chaps. 4-6, we describe the main results of this study. In Chap. 4, we introduce a con¬ 
cept of absolute irreversibility in an example of free expansion, to which the conventional 
integral nonequilibrium equalities do not apply, and define absolute irreversibility in math¬ 
ematical terms. Then, in situations without measurements and feedback control, we derive 
nonequilibrium equalities in the presence of absolute irreversibility based on Lebesgue’s 
decomposition theorem, and verify them in several illustrative examples. In Chap. 5, we 
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generalize the results in Chap. 4 to obtain information-thermodynamic nonequilibrium 
equalities in the presence of absolute irreversibility and verify them analytically in a few 
simple examples. In Chap. 6, we briefly review discussions and conventional resolutions 
on Gibbs’ paradox and quantitatively resolve Gibbs’ paradox based on our nonequilibrium 
equalities with absolute irreversibility. In Chap. 7, we summarize this thesis and discuss 
some future prospects. 
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Chapter 2 

Review of Nonequilibrium Equalities 


In this chapter, we review nonequilibrium equalities in classical statistical mechanics. 
Nonequilibrium equalities are exact equalities applicable to quite general nonequilibrium 
systems, and are generalizations of the second law of thermodynamics and well-known 
relations in linear response theory. Moreover, nonequilibrium equalities give one solution 
of Loschmidt’s paradox. 

In the former half of this chapter, we review various nonequilibrium equalities in 
the chronological order. In the latter half, we review a unified method to derive the 
nonequilibrium equalities introduced in the former half. 


2.1 History 

First of all, we briefly review history of nonequilibrium equalities. 


2.1.1 Discovery of fluctuation theorem 

The field of nonequilibrium equalities was initiated by Evans, Cohen, and Morris in 1993 
p]. In a steady state of thermostatted shear-driven flow, they numerically discovered a 
novel symmetry in the probability distribution of the entropy production rate, which is 
nowadays called the steady-state fluctuation theorem. The theorem reads 


lim - In 

t —»oo t 


Pj g) 

P(- S) 


£, 


( 2 . 1 ) 


where t is the time interval and P(£) is the probability distribution function for the 
time-averaged entropy production rate £ in units of k^T, where k-Q is the Boltzmann 


constant and T is the absolute temperature of the system (see Fig. 2.1). They justified 
this symmetry by assuming a certain statistical ensemble of the nonequilibrium steady 
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Figure 2.1: (a) Probability distribution of the entropy production rate obtained by nu¬ 

merical simulations in shear-driven flow. The abscissa shows the negative of the entropy 
production rate £ in arbitrary units. We can observe that the probability of events vio¬ 
lating the second law does not vanish, (b) Demonstration of the steady-state fluctuation 
theorem The abscissa is the same as (a). The markers represent the logarithm 

of the ratio (log-ratio) of the probability of a positive entropy production to that of the 
reversed sign (namely the log-ratio in the left-hand side of Eq. (2.1)), and the dotted 
line represents the values of the log-ratio predicted by Eq. (2.1). We can confirm that 
the log-ratio is proportional to the negative of entropy production rate. Reproduced from 
Figs. 1 and 2 in Ref. [lj. Copyright 1993 by the American Physical Society. 


state. Subsequently, in the same system, a similar symmetry is disclosed in a transient 
situation from the equilibrium state to the steady state [33] • If is known as the transient 
fluctuation theorem and written as 


Pg) .a 

P(-E) 


( 2 . 2 ) 


This relation was proved under the same assumption as the steady-state fluctuation the¬ 


orem (2.1), which suggests that the fluctuation theorem is not a property particular to 
steady states, but applicable to wider classes of nonequilibrium situations. Equations 


(2.1) and (2.2) demonstrate that the probability of a positive entropy production is ex¬ 
ponentially greater than that of the reversed sign. Moreover, these fluctuation theorems 
reproduce the Green-Kubo relation [35] [36] and Onsager’s reciprocity relation [37], 38] in 
the limit of weak external fields [?]. Therefore, the fluctuation theorems can be regarded 
as extensions of the well-established relations in the linear-response regime to a more 
general nonequilibrium regime. 


These fluctuation theorems (2.1) and (2.2) resolve Loschmidt’s paradox in the follow¬ 
ing sense. Loschmidt’s paradox originates from his criticism to the //-theorem proposed 
by Boltzmann, which demonstrates that the Shannon entropy of the probability distri¬ 
bution function of phase space increases with time in a system obeying the Boltzmann 
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equation, although this equation is symmetric under time reversal. It was claimed that 
the //-theorem is a derivation of the second law of thermodynamics, and the irreversible 
macroscopic law can be derived from the reversible microscopic dynamics. However, 
Loschmidt criticized this argument by the observation that it should be impossible to 
deduce irreversible properties from the time-reversal symmetric dynamics. If we have a 
process with a positive entropy production and reverse the velocity of all the particle in 
the system at once, we can generate a process with a negative entropy production since 
the dynamics is time-reversal symmetric. Therefore, the entropy of the system should not 


always decrease, which is a defect of the //-theorem. Fluctuation theorems (2.1) and (2.2) 
demonstrate that, as Loschmidt pointed out, paths with a negative entropy production 
have nonzero probability. However, an important implication of the fluctuation theorems 
is that the probability of a negative entropy production is exponentially suppressed in 
large systems or in the long-time limit (namely when St 1). Therefore, paths violating 
the second law cannot be observed in macroscopic systems in practice. In this way, the 
fluctuation theorems reconcile Loschmidt’s paradox with the second law originating from 
the reversible dynamics. 

Soon after the discovery of the fluctuation theorems, the steady-state fluctuation the¬ 


orem (2.1) was proved under the chaotic hypothesis, which is an extension of the ergodic 
hypothesis, in dissipative reversible systems [3139]. Later, a proof free from the chaotic 
hypothesis was proposed in Langevin systems since the Langevin dynamics is ergodic in 
the sense that the system relaxes to the thermal equilibrium distribution in the long-time 
limit [3], and then generalized to general Markov processes [I] . Moreover, both the steady- 
state fluctuation theorem (2.1) and the transient fluctuation theorem (2.2) were shown in 
general thernrostatted systems in a unified manner [5j. It was pointed out that this proof 
of the fluctuation theorems is applicable even to Hamiltonian systems, although it had 
been believed that the thermostatting mechanism is needed for the fluctuation theorems 
[6]. Thus, the fluctuation theorems are known to apply to wide classes of nonequilibrium 
systems. 

The transient fluctuation theorem (2.2) was experimentally demonstrated in Ref. [JO]. 
They prepared a colloidal particle in an optical trap at rest, and then translated the trap 
relative to the surrounding water. In this transient situation, they obtained the probability 
distribution of entropy production, and observed trajectories of the particle violating the 


second law at the level of individual paths (see Fig. 2.2). The amount of this violation 
was confirmed to be consistent with Eq. (2.2). Later, the steady-state fluctuation theorem 


(2.1) was also verified in the same setup m- 
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Figure 2.2: (a) Histograms of time-averaged entropy production of a dragged colloidal 

particle in units of kf$T, where k-Q is the Boltzmann constant. Two different types of the 
bars represent two different measurement-time intervals. We can observe that the proba¬ 
bility of paths violating the second law does not vanish, (b) Log-ratio of the probability 
of entropy production to that of its negative. The experimental results are consistent 
with the transient fluctuation theorem (2.2). Reproduced from Figs. 1 and 4 in Ref. 
Copyright 2002 by the American Physical Society. 


2.1.2 Jarzynski equality 

In 1997, Jarzynski discovered a remarkable exact nonequilibrium equality in a Hamiltonian 
system |8]. Let H(X, x ) denote the Hamiltonian of the system, where A is an external 
parameter that we control to manipulate the system and x represents internal degrees of 
freedom of the system. The system is initially in equilibrium with the inverse temperature 
/3, and we subject the system to a nonequilibrium process by our manipulation of A from 
A; to Af. Let F( A) denote the equilibrium free energy of the system under a given external 
parameter A, i.e., 

e -HFm = f dx ( 2 . 3 ) 

The Jarzynski equality relates the free-energy difference A F := F( Af) — F( A,) to the 
probability distribution of work W performed during the nonequilibrium process as 

F)) = (2.4) 


where the angular brackets mean the statistical average under the initial equilibrium 
state and the given nonequilibrium protocol. Soon after the discovery, the same equality is 
proved in stochastic systems based on the master equation formalism |9j. It is noteworthy 
that we assume nothing about how fast we change the external parameter A, and therefore 


the Jarzynski equality (2.4) remains valid under a rapid change of the parameter, which 
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means that the Jarzynski equality applies to processes beyond the linear response regime. 


Moreover, the Jarzynski equality (2.4) is an extension of conventional thermodynamic 


relations to the case of a rather general nonequilibrium regime [8j. First, it leads to a 
second-law-like inequality in isothermal processes. Using Jensen’s inequality, we obtain 


,~/3(W-AF) 


) > e 


-/ 3(W-AF) 


(2.5) 


Combining this inequality with the Jarzynski equality (2.4), we conclude 


{W) > A F, 


( 2 , 6 ) 


which is the second law of thermodynamics in isothermal processes. The equality con¬ 
dition is that W has a single definite value, i.e., W does not fluctuate. We can regard 
W — A F as the total entropy production of the system. Let A U and Q denote the 
internal-energy difference and dissipated heat from the system to the heat bath, respec¬ 


tively. Then, the first law of thermodynamics is AU = W — Q. We rewrite Eq. (2.6) 


as 


AU- AF 
T 


Q 

^ > 0, 
T ~ 


(2.7) 


where T is the temperature of the heat bath (and therefore the initial temperature of 
the system). The first term represents the entropy production of the system AS because 
AF = AU — FAS', and the second term is the entropy production of the heat bath. 


Therefore, Eq. (2.6) means that the entropy production of the total system must be 
positive. 


Secondly, the Jarzynski equality (2.4) reproduces the fluctuation-dissipation relation 


in the linear response theory [8]. Let us denote a = /3(W — AF), and assume that the 
dissipated work W — AF is much smaller than the thermal energy k-&T, namely, |er| -C 1. 
Expanding ln(e -<7 ) up to the second order in a, we obtain 


ln(e-“) ~ —<«7> + -««x 2 ) - (<7> 2 ). 


( 2 . 8 ) 


Substituting the Jarzynski equality (2.4), we obtain 

1 


(W) — AF = 


2k B T 


(( w 2 ) - (wy 


(2.9) 


The left-hand side is dissipation of the total system, and the right-hand side represents 
fluctuations of the work during the process. This is one form of the fluctuation-dissipation 
relation. 
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Not only does the Jarzynski equality reproduce the second law of thermodynamics, 
but also it gives a stringent restriction on the probability of events violating the second 
law [123. Let a 0 be a positive number and let us calculate the probability that the entropy 
production is smaller than — <7 q: 


Probfcr < — <r 0 ] = 

< 


r--cro 


da P(o ) 


' — OO 
/‘-<T 0 


/ da P(a)e cr ° 
J —OO 

(e~ a )e~ a ° 


= e _fT °, 


( 2 . 10 ) 


where we use the Jarzynski equality (2.4) to obtain the last line. Therefore, the probability 


of negative entropy production is exponentially suppressed. While a negative entropy 
production (cr < 0) may occasionally occur, a greatly negative entropy production (|cr| 

1) is effectively prohibited by the Jarzynski equality. 


In addition to the above-described properties, the Jarzynski equality (2.4) enables us 


to determine the free-energy difference from our observation of how the system evolves 
under a nonequilibrium process because 


A F = -~\n(e-P w ), 


(2T1) 


although the free-energy difference is an equilibrium property of the system P03]. A 
naive method to determine the free-energy difference in experiments or numerical simula¬ 
tions is to conduct reversible measurements of work and use the fact W = A F. However, 
this method is not realistic in general because an extremely long time is needed to real¬ 
ize even approximately reversible processes. A more sophisticated method is to use the 


linear response relation (2.9) to determine the free energy difference through the work 


distribution obtained in the measurements. This method may still be time-consuming 
because the manipulation must be slow enough for the system to remain in the linear 


response regime. Equation (2.11) enables us to reduce the time of experiments or simu¬ 


lations when we calculate the free-energy difference, because Eq. (2.11) is valid even for 
rapid nonequilibrium processes. Q 

Hummer and Szabo found an experimentally useful variant of the Jarzynski equality 
m- The Hummer-Szabo equality can be utilized to rigorously reconstruct the free-energy 


Wire exact equality (2.11) requires more samples for convergence than the approximate equality (2.9) 
does [43]. Therefore, the total time required for convergence, which is the time of the process multiplied 


by the number of samples, can be longer when we use Eq. (2.11) than we use (2.9). 
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Figure 2.3: (a) Experimental setup of the stretching experiment. An RNA molecule is 

attached to two beads. One bead is in an optical trap to measure force and the other is 
linked to a piezoelectric actuator, (b) Difference between the estimated free energy and 
its actual value. The solid curves represent the reversible estimation (A F = (W)). The 
dotted curves represent the linear response estimation by Eq. (2.9). The dashed curves 
represent the estimation based on the Jarzynski equality, namely, Eq. (2.11). We see that 
the Jarzynski equality gives the best estimation. Reproduced from Figs. 1 and 3 in Ref. 
Copyright 2002 by the American Association for the Advancement of Science. 


landscape of a molecule from repeated measurements based, for example, on an atomic 
force microscope or an optical tweezer. Shortly thereafter, in a setup with an optical 


tweezer shown in Fig. 2.3 (a), the free-energy profile of a single molecule of RNA was 
reconstructed by mechanically stretching the RNA in an irreversible manner 03 (see Fig. 


2.3 (b)). This experiment demonstrated that the Jarzynski equality is useful in practice to 


determine free-energy differences of systems. In a similar manner, the free-energy profile 
of a protein was estimated by stretching the protein by an atomic force microscope 


2.1.3 Crooks fluctuation theorem 


In 1998, Crooks offered a new proof of the Jarzynski equality in stochastic systems 03- 
What is remarkable about this proof is the method comparing an original process with 
the time-reversed process to derive nonequilibrium relations. One year later, this idea led 
him to propose a novel relation now known as the Crooks fluctuation theorem (10J, which 
reads 


m =e . 


( 2 . 12 ) 
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where P(cr) is the probability distribution function of entropy production er, and Pt(cr) 
is that in the time-reversed process. Later, this theorem was generalized to Hamiltonian 
systems with multiple heat baths by Jarzynski 


The Crooks fluctuation theorem (2.12) can be regarded as a generalized version of 


the steady-state fluctuation theorem (2.1) in systems symmetric with respect to reversal 


of the perturbation that drives the steady flow; in this case, we have Pt(— a) = P(—cr), 


which reduces Eq. (2.12) to Eq. (2.1), because there is no difference between the original 


process and the time-reversed one. What is more, the Crooks fluctuation theorem easily 
reproduces the Jarzynski equality as follows: 


<o = 


' —oo 
/*oo 


da e <T P(a) 
da P\a) 


= 1 , 


(2.13) 


where we have used the Crooks fluctuation theorem (2.12) to obtain the second line, and 


the normalization of probability to obtain the last line. Moreover, the Crooks fluctuation 
theorem implies that 


a = 0 ^P(a) = P\-a). 


(2.14) 


Soon after, Crooks found a significant generalization of his own theorem HU He 
related irreversibility of an individual path T in phase space to its own entropy production 
cr[T], ]^] that is, 


m 

pt[rt] 


= e CT[r] , 


(2.15) 


where W represents the time-reversed path of T; V is the path-probability functional 
under a given dynamics, and V t is that under the time-reversed dynamics. To reproduce 
Eq. (2.12) from Eq. (2.15), we note 


a 


P(a) = I VTV[T]5(a[T] - a) 

e a[r] 5(a[T} 

= e a J VT^p]6(ap} + a 
= e°P'(-a), 



(2.16) 


We use box brackets to indicate that f[T] is a functional of T instead of a function. 
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where VY denotes the natural measure on the set of all paths, and we use Eq. (2.15) 


to obtain the second line, and we assume entropy production is odd under time reversal, 
namely cr [r] = — <r[Tt], to obtain the third line. A more general form of the nonequilibrium 


integral equality can be derived based on Eq. (2.15). Let WfT] be an arbitrary functional. 
Then, we obtain 


(Te~ a ) = J i>rp[r]e-''n.F[r] 



= <F)\ 


(2.17) 


where (• • • )t denotes the average over the time-reversed probability ^[T']. When we set 


T to unity, Eq. (2.17) reduces to Eq. (2.13). The idea to consider the path-probability 


of an individual path is a crucial element to treat nonequilibrium equalities in a unified 
manner as described in Sec. 2.2. 

The Crooks fluctuation theorem (2.12) was experimentally verified [HHj in a similar 


setup in Ref. 051. which was used to verify the Jarzynski equality. The work extracted 
when an RNA is unfolded and refolded was measured, and the work distribution of the 
unfolding process and that of the refolding process, which is the time reversed process of 


unfolding, were obtained (see Fig. 2.4 (a)). It was verified that the two distributions are 


consistent with the Crooks fluctuation theorem (2.12) (see Fig. 2.4 (b)). Moreover, the 


free-energy difference of a folded RNA and an unfolded one was obtained using Eq. (2.14), 
and the obtained value is consistent with a numerically obtained one. 


2.1.4 Further equalities 

In this section, we will briefly review further nonequilibrium equalities. To this end, let 
us introduce some kinds of entropy production. Mathematical definitions of these kinds 
of entropy production are presented in Sec. 2.2. 

The total entropy production As to t is the sum of the Shannon entropy production of 
the system As and the entropy production of the heat bath Asbath, that is, 

As to t = As + ASbath- (2.18) 

The entropy production of the bath is related to the heat q dissipated from the system to 
the bath 


Asbath = q/T, 


(2.19) 
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Figure 2.4: (a) Work distribution of the unfolding process (solid lines) and the refolding 

process (dashed lines). Three different colors correspond to three different speed of un¬ 
folding and refolding. We find that the work value where two distributions at the same 
speed cross each other is independent of the speed (as shown by the vertical dotted line). 
Equation (2.14) implies that this value corresponds to zero entropy production, namely, 
the work value is equal to the free-energy difference, (b) Verification of the Crooks fluctu¬ 
ation theorem (2.12). In the setup of this experiment, the entropy production a reduces 
to work W minus the free-energy difference. It is illustrated that the log-ratio of the 
probability distributions of work is proportional to work itself. Reproduced from Fig. 2 
and the inset of Fig. 3 in Ref. [49]. Copyright 2005 by the Nature Publishing Group. 


where T is the absolute temperature of the bath. In a steady-state situation, the heat q 
is split into two parts as 


q = qhk + <?ex, (2.20) 

where q kk is called the housekeeping heat, which is the inevitable heat dissipation to 
maintain the corresponding nonequilibrium steady state, and q ex is called the excess heat, 
which arises due to a uon-adiabatic change of the external control parameter. Along with 
these definitions of heat, we define two kinds of entropy production as 

As hk = q hk /T, As ex = q ex /T. (2.21) 


Hatano-Sasa relation 

The Hatano-Sasa relation is a generalization of the Jarzynski equality [50]. The Jarzynski 
equality relates the free-energy difference of two equilibrium states to the nonequilibrium 
average starting from one of the equilibrium states. In a similar manner, the Hatano-Sasa 
relation relates the difference of nonequilibrium potentials 0 of two nonequilibrium steady 
states, which is a generalization of U — F in equilibrium situations, to an average starting 
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from one of the steady state. The Hatano-Sasa relation reads 


/g-A^-Asex/feB\ _ ^ 


( 2 . 22 ) 


From Jensen’s inequality, we obtain 

(A</>) + > o. 


(2.23) 


This inequality can be interpreted as a nonequilibrium version of the Clausius inequality 


(As > — q/T ). In fact, the inequality (|2.23[) can be rewritten as 


&b(a 4 >) > 


(gex) 

T 


(2.24) 


The equality can be achieved in quasi-static transitions between the two nonequilibrium 
steady states, which is also analogous to the Clausius inequality, whose equality is also 
achieved in quasi-static transitions. 


Seifert relation 

The Seifert relation applies to an arbitrary nonequilibrium process starting from an arbi¬ 
trary initial state |51j . The integral Seifert relation is given by 

( e -A Stot /fc B } = L (2.25) 


The corresponding inequality is 


(Astot) > 0, 


(2.26) 


which can be considered as a second law in a nonequilibrium process. The detailed version, 
which is analogous to the Crooks fluctuation theorem, is given by 


P{ As tot ) _ -Astot/fcs 
-P(Astot) 


(2.27) 


It is noteworthy that these relations hold for an arbitrary time interval. Equation (2.27) 


can be regarded as a refinement of the steady state fluctuation theorem (2.1). The steady 


state fluctuation theorem (2.1) holds only in the long-time limit. This is because £ in 


Eq. (2.1) is in fact the entropy production rate of the bath and does not include that of 
the system. Therefore, in Eq. (2.1), the time interval should be long enough to ignore the 
entropy production of the system compared with that of the bath. 
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Relation for housekeeping entropy production 

In Ref. [52], a relation for housekeeping entropy production is also obtained as 

(e' Ashk/fcB ) = 1, (2.28) 


which leads to 


(Ashk) > 0. 


(2.29) 


2.2 Unified formulation based on reference probabil¬ 
ities 


In this section, we review a unified strategy to derive the nonequilibrium equalities intro¬ 
duced in the previous section. 

In Ref. m, Crooks revealed that a unified approach to derive nonequilibrium equal¬ 
ities is to compare the nonequilibrium process with the time-reversed process and obtain 
a detailed fluctuation theorem, namely, the Crooks fluctuation theorem given by 


'UU = ~P(W[T]-AF) 

V[T] 


(2.30) 


where quantities with a superscript f are the ones in the time-reversed process. Later, 


Hatano and Sasa derived the nonequilibrium equality (2.22) in steady states by comparing 
the original dynamics with its “time-reversed dual” dynamics in a sense to be specified 
later. In their derivation, they essentially used a detailed fluctuation theorem given by 


U P_1 = -A0[r]-Asex[r]/fc B 
V[T] 


(2.31) 


where quantities accompanied by f are, in this equality, the ones in the time-reversed dual 


Table 2.1: Choices of the reference dynamics and the associated entropy productions. 
Specific choices of the reference dynamics lead to specific entropy productions with specific 
physical meanings. The meaning of the “boundary term” is explained later. 


Reference dynamics 

Entropy production a 

time reversal 
steady-flow reversal 
time reversal + steady-flow reversal 

Asbath/^B + (boundary term) 
Ashk/^B + (boundary term) 
Ase X //cB + (boundary term) 
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process. Their work indicates that a further unification is possible, namely, a detailed 
fluctuation theorem will be obtained when we compare the original dynamics with a 
properly chosen reference dynamics. With this speculation, let us generalize the detailed 


fluctuation theorems (2.30) and (2.31) to 


V r [ T] 

V[T] 


= e~ alr l 


(2.32) 


where V T represents a reference probability of a reference path, and <t[T] is a formal 
entropy production. In the case of the Crooks fluctuation theorem ( 2.12[ ), the reference 
is the time reversal, and cr[T] reduces to W [T] — A F. In the case considered by Hatano 
and Sasa, in a similar way, the reference is the time-reversed dual, and cr[T] reduces to 


A0[T] + As ex [r]/fc B . Once we obtain the detailed fluctuation theorem (2.32), we succinctly 
derive an integral nonequilibrium equality given by 


(O = i, 


(2.33) 


because 


(e~ a ) = I VTe- a V[T] 

= I VYV T [T] 

= 1 , 


(2.34) 


where we use Eq. (2.32) to obtain the second line, and we use the normalization condition 


for the reference probability to obtain the last line. In the same way, we obtain 


= (jy, 


(2.35) 


where J-"[r] is an arbitrary functional, and (• • • ) r represents the average over the reference 
probability V r \Y\. 


In the rest of this section, we validate Eq. (2.32) in specific systems. To be precise, 


we show that appropriate choices of the reference probability make the formal entropy 
production a reduce to physically meaningful entropy productions. The results are sum¬ 
marized in Table 12.11 


2.2.1 Langevin system 

First of all, we consider a one-dimensional overdamped Langevin system. One reason 
why we deal with the Langevin system as a paradigm is that steady states can be simply 
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achieved by applying an external driving force. Moreover, the Langevin system is ther¬ 
modynamically sound in that it relaxes to the thermal equilibrium state, i.e., the Gibbs 
state after a sufficiently long time without the external driving force. 

This part is mainly based on a review article by Seifert, i.e., Ref. [53]. 

Basic properties and definition of heat 

Let x(t) denote the position of a particle at time t in a thermal environment with tem¬ 
perature T. We consider a nonequilibrium process from time t = 0 to r controlled by an 
external parameter A (t). The overdamped Langevin equation is given by 

x(t) = fiF(x(t), A(t)) + ((t), (2.36) 

where /j is the mobility, and F(x, A) is a systematic force applied to the particle with the 
position x when the external control parameter is A, and ((t) represents a random force. 
We assume that ((t) is a white Gaussian noise satisfying 

(at)C(t'))=2D8(t~t'), (2.37) 

where D is the diffusion constant. The systematic force F(x, A) consists of two parts, that 
is, 


F (x, A) = -d x V(x, A) + f{x, A). (2.38) 

The first part is due to the conservative potential V(x, A) and /(x, A) is the external driv¬ 
ing force. Under this Langevin dynamics, the probability to generate an entire trajectory 
{x} starting from x 0 = x(0) under a given entire protocol {A} is calculated as 

P[{x} |x 0 , {A}] = tfe ~ A[{x} ’ W] , (2.39) 


where the action M[{x}, {A}] of the trajectory {x} is 


M[{x},{A}] 


r ro * _ ^ f ( x ’ x )) 2 , 9 x f ( x , a ) 

Jo l 4 D +/i 2 


(2.40) 


(see Appendix A for a derivation). Another strategy to describe the system is to trace 
the probability p(x, t ) to find the particle at x at time t. We can show p(x, t) to obey the 
Fokker-Planck equation given by 


d t p(x,t) = -d x j(x,t), 


(2.41) 
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where the probability current j(x,t) is defined as 


j(x, t ) = A (t))p(x, t) - Dd x p(x, t ) (2.42) 

(see Appendix A for a derivation). With this probability, the entropy of the system is 
defined as the stochastic Shannon entropy of the system, i.e., 

s(t) = — k B lnp(x(£), t). (2.43) 


The mean local velocity is defined as 


v(x,t ) 


p(x,ty 


(2.44) 


When the external driving force is not applied, i.e., f(x, A) = 0, the system relaxes to 
the thermal equilibrium state given by 


Pe q (x, A) 




(2.45) 


where (3 is the inverse temperature and free energy F( A) is defined as 

e -P F W = I dxe~ pv{x ' x \ (2.46) 

because the kinetic energy contributes nothing due to the assumption of overdamping. 

Next, we consider how we should define heat in this Langevin system m- The dissi¬ 
pated heat is the energy flow from the system to the bath. This energy transfer is done by 
the viscous friction force —jx and the thermal noise 7 ^(t), where 7 = 1/fi is the friction 
coefficient. Therefore, the “work” done by these force should be identified with the heat 
flowing into the system. Thus we define the dissipated heat as 


<ll{ x }} = ~ dtxi-'yx + 7 C). 
Jo 


(2.47) 


Using the Langevin equation (2.36), we obtain 


?[{ x }] = / dtxF(x, A). 

Jo 

Now we can define the entropy production of the heat bath as 

a q 

^A-Sbath — 7 ^- 


(2.48) 


(2.49) 
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In this way, the concepts of the heat and entropy are generalized to the level of an 
individual stochastic trajectory. 


Steady-state properties and definitions of thermodynamic quantities 


When the external driving force f(x, A) is applied at a fixed A, the system relaxes to a 
nonequilibrium steady state p ss (x,X). In the analogy of the equilibrium state (2.45), let 
us define a nonequilibrium potential cj)(x, A) by 


p BB (x,\) = e-X*' x >. 


(2.50) 


The steady current is defined as 

j ss (x, A) = fiF(x, A )p ss (x, A) - Dd x p ss (x, A), (2.51) 


and the mean velocity in the steady state is given by 


v ss (x, A) 


3bs(x, a) 

Pbb(x, A) 

fiF(x, A) - Dd x In p ss (x, A) 
pF(x, A) + Dd x (p(x, A). 


(2.52) 


Using this expression, we rewrite Eq. (2.48) as 


g[{a:}] = — / dtxv ss (x, A) — k B T / dtxd x cj)(x, A), 
I 1 Jo Jo 


(2.53) 


where the Einstein relation D = fik B T is assumed. The first term is the inevitable 
dissipation proportional to the steady mean velocity v ss and is therefore identified as the 
housekeeping heat, namely, 


<?hk[{a;}] 


1 

h 


dtxv ss (x, A), 


(2.54) 


and the second term is the excess contribution after the subtraction of the housekeeping 
part from the total heat, and defined as 


q ex [{x}} =-k B T / dtxd x (j)(x, A). 
Jo 


(2.55) 
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Therefore, following Ref. [50], we define two kinds of entropy production as 


and 


As hk [{a:}] = ^ 


dtxv ss (x, A), 


(2.56) 


A S ex [{A }] 


— k-Q / dtxd x (f>(x , A) 
Jo 


k-Q 


—A0 + f dt\d\(j)(x, A) , 
Jo 


(2.57) 


because dcj) = ( d x (j))dx + (d\(ft)d\. Note that the ensemble average of the excess entropy 
production vanishes in steady states because A is independent of time, which justifies that 
the excess entropy production is indeed an excess part due to non-adiabatic changes of 
the control parameter A. 


Time reversal and Asbath 

We consider a process starting from an initial probability distribution po(^o) under a pro¬ 
tocol {A}, and the time-reversed process starting from an initial probability distribution 
Po^J) under the time-reversed protocol {A^} defined by A^(f) := A (t — t). ffere, we do 
not assume any relations between po(xo) and Po(a4). Now we compare the realization 
probability of the original path {x} and that of the time-reversed path {ad} defined by 
x\t) = x(t — t). The original probability is 


^[MI{A}] = P[{a:}|a;o,{A}]po(a:o) 
= J\fp 0 (x 0 )e~ A ^ x k{ x }\ 


and the time-reversed probability is 




p(U t }l4.{ At }]Po(4) 

A/4(4)e-^’>.OTI 


Therefore, the formal entropy production defined in Eq. (2.32) reduces to 


<r[M] 


V[{x'}\{ A f }] 

Hi*} |{A}] 

M[{x'},{A'}]-4{x},{A}])-lnAA^ 


(2.58) 


(2.59) 


(2.60) 
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The first term is called the bulk term, and the second term is called the boundary term, 
because the second one arises from the boundary conditions po and p\. Using Eq. (2.40), 
we obtain 




dt 


dt 


dt 


(id(t) -/xF(x t (t),A t (t))) 2 d x F(x\t), A f (i)) 
- +V -o- 


AD 

(-x(r -t) - pF(x(r -t),A(r - t))) 2 d x F(x(r - t), A(r - t)) 

-4 D - + "-2- 

(x(t) + /iF(x(t),X(t))) 2 d x F(x(t),\(t)) 


AD 


+ 


(2.61) 


where we change the integration variable from t to r — t to obtain the last line. Therefore, 
the bulk term is 




kuT 


B 1 Jo 


dtxF(x, A) = 


q[{x}} 

kuT ’ 


(2.62) 


where we use the definition (2.48) to obtain the last equality. By Eq. (2.49), we obtain 

4 ( 4 ) 


r r 11 ^ s bath , . 

ff|W1 = — -“UK)' 

Here, we assume that the initial state is in equilibrium 


(2.63) 


PoM =P„(i 0,A„) = 


(2.64) 


Although the initial state of the reversed dynamics can be set to an arbitrary probability 
distribution, we set it also to the canonical ensemble as 


4(4)=p eq (4,Aj) = eHWU)-^)) 


= e -/3(U(x T ,\ T )-F(\ T ))' 


(2.65) 


In this case, we obtain 


4(4] = 


As 


bath 


h 


+ (3(AU — AF) 


= P{q + AU - AF) 
= 3{\Y -AF), 


( 2 . 66 ) 


which means that we obtain the Crooks fluctuation theorem (2.12). Therefore, we also 


obtain the Jarzynski equality (2.4) succinctly. 


Next, we assume nothing about po, and set the initial probability of the reversed 
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dynamics to the final probability of the original dynamics, namely, 


4 ( 4 ) = p( x l , T ) =p(x t ,t). 


(2.67) 


In this case, the boundary term reduces to the Shannon entropy production of the system 
as 


a[{x}] 


Therefore, we obtain 


As 


bath 


fa 


In p(x T , t ) + lnp(x 0 , 0) 


ASfaath + s ( t ) - s(0) 
k B 

ASbath + As 
k B 


a[{x}] 


Astot 

k B 


( 2 . 68 ) 


(2.69) 


and the detailed Seifert relation (2.27), which automatically 


relation (2.25). 

Finally, we assume nothing about p 0j and set the initial 
dynamics to p 0 . Then, we obtain 


derives the integral Seifert 
probability of the reversed 


4M] = 


(2.70) 


where 


= fi'' - \np a (x T ) + lnpo(^o) 


(2.71) 


is the dissipation functional defined in Ref. p)j and used in the transient fluctuation the¬ 
orem (2.2). 


Dual (steady-flow reversal) and Ashk 

Next, we consider the steady-flow-reversed dynamics. To this aim, we define the dual of 
the external force F(x, A) as 

F\x, A) = F(x, A) - 2t;ss ’ f(3;,A ) , (2.72) 

h 

where we use v SSt F to represent the steady velocity under force F. Let us demonstrate 
that the steady state under force F is also the steady state under force Ffi Comparing 
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Eq. (2.52), we obtain 


pF^(x, A) = —/lF(x , A) — 2 Dd x <j>p{x, A), 


(2.73) 


or 


fiF^x, A) + Dd x (j)p{x 1 A) = —(fiF(x, A) + Dd x cf)F(x, A)). 


(2.74) 


Since p SSt F(x, A) = e is the steady solution of the Fokker-Planck equation (2.41), 

we have 


0 d x j SS F^x , A) 


d x {pF{x, A )p ss , F (x, A) - Dd x p SS! p(x, A)) 
<9 x [(/uF(x, A) + Dd x <j> F (x, \))p ss , F (x, A)]. 


Using Eq. (2.74), we obtain 


0 = d x [(nF^(x, A) + Dd x (j) F {x, X))Pss,f(x, A)] 
= ^(/nF^x, A )pss,f(x, A) - Dd x p SSyF (x, A)), 


which means p SS) i? is also the steady state solution under force F\ that is, 


p ss>F (x ,\) = Pss,fi(x,\) p ss (x, X)), 
<j> F (x, A) = 0Ft(x, A) (=: 0(x, A)). 


Comparing Eqs. (2.51) and (2.74), we have 


j ss , F (x, A) = -j ss ,Fi(x, A) (=: j ss (x, A)). 


(2.75) 


(2.76) 


(2.77) 

(2.78) 


(2.79) 


Therefore, the dual dynamics has the same steady state as the original dynamics and the 
negative of the steady current in the original dynamics. 

Now, we consider a process starting from an initial probability distribution Po(xq) 
and the dual process starting from an initial probability distribution p\{x o). The original 
probability is 


P f [{x}|{A}] = V F [{x}\x 0 , {\}]po(x 0 ) 

= Afp 0 (xo)e~ AF[{x} ’ {x}] (2.80) 
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and the dual probability is 


*Vt[{*}|{A}] = ^Ft[{a:}ko,{A}]po(^o) 

= Mpl{x 0 ) e -' 4 Ft[W.{A}]_ ( 2 . 81 ) 

Therefore, the formal entropy production reduces to 

*[{*}] = (-MM, {A}] - -4 F [{*}, {A}]) - In . (2.82) 

The bulk term can be calculated as 


Apt [{x}, {A}] - A F [{x}, {A}] 

3 . d x F 
- + h^r- 


= dt 


dt 

(x — pi 

4 D 

dt 



dt 


(x - pFf , d x F 


4 D 


+ p- 


h 

2 ( 


(2.83) 


Using Eqs. (2.72) and (2.73), we obtain 


AlFt[{x},{A}] -„4 f [{x},{A}] = 


dt 


— (i + Dd x (j))v ss - d x v s 


= — / dtxv ss - 

D Jo 

= — / dtxv s8 , 


dte^d x j S: 


(2.84) 


where we use j ss = v ss e~^ to obtain the second line and the fact that the divergence of 
the current vanishes in the steady state to obtain the last line. By the definition (2.56), 
we obtain 


When we set p' Q to po, the formal entropy production reduces to 

"[{*}] = ( 2 . 86 ) 

Kb 

and therefore we obtain the integral fluctuation theorem for the housekeeping entropy 


production (2.28). 






















Time-reversed dual and As ex 

We consider a process starting from an initial probability distribution Po(x 0 ) under a 
protocol {A}, and compare it with the dual process starting from an initial probability 
distribution p' 0 (x q) under the time-reversed protocol {At}. The original probability is 




'Pf[{x}\xqi {A}]p 0 (x 0 ) 

Np 0 (x 0 )e- AF[{X} ’ {X}] 


and the time-reversed dual probability is 


^t[{xt}|{At}] 


p Ft [{x t }i4,{A t }K(4) 

W4(4)e-^ t[{- + }TA+}] 


Therefore, we obtain 

*[{*}] = (-V]^}, {At}] - A f {{x}. {A}]) - In 
The bulk term is calculated as 


(2.87) 


( 2 . 88 ) 


(2.89) 


^t[{x t },{A t }]-^[{x},{A}] 

(-x-^Ft ) 2 dxF t 

AD ^ 2 


= / dt 
Jo 

= [ dt 
Jo 


- / dt 
Jo 


(x - pF) 2 d x F 
AD ^ 2 


~(F + F*)(-2x + p(F - Ft)) + ^ x (Ft - F) 


(2.90) 


Using Eqs. (2.72) and (2.73), we obtain 


^t[{xt}, {At}] - A f [{x}, {A}] = dt [{d x <j)){-x + v ss ) - d x v ss ] 


= — dtxd x (f) — / dte^dxjs 

Jo Jo 

— — I dtxd x (j). 


(2.91) 


By the definition (2.57), we obtain 


(j[{x}] = 


As P 


In 


4 ( 4 ) 

Po(xo)' 


(2.92) 
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We assume that the initial state is the nonequilibrium steady state given by 


p 0 (x 0 ) = e ~^ xoM) (2.93) 

and set the initial state of the time-reversed dual dynamics to the nonequilibrium steady 
state as 



— 0 


(2.94) 


Then, we obtain 


a[{x}] 


A.Sex 

k B 

A Sex 

k B 


+ (j)(x T , X T ) - <j>(x Q , A 0 ) 
+ A 4>. 


Therefore, we reproduce the Hatano-Sasa relation (2.22) in a simple manner. 


(2.95) 


2.2.2 Hamiltonian system 

Next, we consider a Hamiltonian system consisting of a system and a heat bath with 
inverse temperature /3. We consider only time reversal as the reference dynamics because 
it is difficult to define steady flow in a Hamiltonian system in general. 

This part is partly based on Ref. [55J. 

Setup 

Let 0 denote the position in the phase space of the total system. We separate the degrees 
of freedom z into two parts as z = (x,y), where x denotes the degrees of freedom of the 
system, and y is the degrees of freedom of the bath. We assume that the Hamiltonian of 
the total system can be decomposed into 


A) = H(x, A) + H int (x, y, A) + H hath (y), (2.96) 

where A is an external control parameter. We also assume that the Hamiltonian is invari¬ 
ant under time reversal. We vary A from time t — 0 to r, and the system is subject to a 
nonequilibrium process. The free energy of the bath Fbath is defined by 

0 4q,ath 


= / dye 




(2.97) 
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and is invariant during the process. Moreover, we define an effective Hamiltonian of the 
system H e f(x, A) by tracing out the degrees of freedom of the bath as 

e ~/3H ei (x,\) e ~/3F hath 

The free energy of the total system F tot ( A) defined by 

e -/3Ftot(A) 

and the free energy F( A) based on H e f(x, A) defined by 

e ->n^j dxe -^ (2 , 00) 

are related by 


= / dze~ pH '°^ 


(2.99) 


= e -pH{x, A) 


dye 


-/3(tf int (x,{/)+ffbath(i/)) 


(2.98) 


-Ptot(A) — F( A) + Tbath- 


( 2 . 101 ) 


Time reversal 


We compare the original process { 2 } starting from an initial probability distribution of the 
total system po(^o) with the time-reversed process { z t} starting from an initial probability 
distribution of the total system Pq(zq), where z\t) = z*(t — t ) and the superscript * 
represents the sign reversal of momenta. We note that the probability to realize a path 
{z} is the same as the probability to have z o in the initial state because the Hamiltonian 
system is deterministic as a whole. Therefore, we obtain 


4{4] 


, 4(4} |{At}] 
v K4I(A}] 

i 4(4l 

p 0 (z 0 y 


( 2 . 102 ) 


Now, we assume that the initial state is the equilibrium state of the total Hamiltonian 


p 0 fzo) = 


(2.103) 


Moreover, we set the initial state of the reversed process to the equilibrium state of the 
total Hamiltonian 


p t( z t) — 


(2.104) 
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Thus, we obtain 


a[{;r}] = f3(AH tot - AF tot ). (2.105) 

Since the total system is isolated, the difference of the total Hamiltonian is due to the 
work done by the external controller, that is, 

AH tot = W. (2.106) 


Therefore, noting that Fbath is independent of A, we obtain 


a — j3(W - AF), 


(2.107) 


which gives the Crooks fluctuation theorem (2.12) and the Jarzynski equality (2.4) 


Next, we assume that the initial state of the system is not correlated with the initial 
state of the bath and that the initial state of the bath is the canonical ensemble, namely, 


Po(zq) = po(x 0 )e AH bs .th{yo) F hath ). 


(2.108) 


In addition, we set the initial state of the time-reversed dynamics to the product state of 
the final probability distribution of the system in the original process and the canonical 
ensemble of the bath 



P (Xq)c /^(^hjath(yo) H>ath) 
P \ g /3(^bath(yr)~ ^bath) 


(2.109) 


Therefore, we obtain 


a[{x}] = /3A if ba th - lnp T (x T ) + lnp 0 (^o)- (2.110) 


We define an unaveraged Shannon entropy of the system by s(t) = — hip t (x t ), and we 
have 

Ac 

& — /dAi/bath + 7—• (2-111) 

Kb 

The energy change of the bath A//bath can be regarded as the heat Q dissipated from 
the system to the bath. Moreover, the heat Q is related to the entropy production of the 
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bath Asbath as Q/T = Asbath- Thus, we obtain 


ASbath + As 
k b 

Astot 

k B 


( 2 . 112 ) 


where As to t = Asbath + As is the total entropy production. Therefore, we automatically 


reproduce the detailed Seifert relation (2.27) and the integral Seifert relation (2.25). 


The results obtained in this section are summarized in Table 12.21 


Table 2.2: Summary of choices of the reference probability and specific meanings of the 


entropy production. 


Reference dynamics 

Reference initial state Po( x o^) 

Entropy production a 

time reversal 

canonical p eq (x T , X T ) 

dissipated work f3{W — A F) 

time reversal 

final state p T (x T ) 

total Astot/k B 

time reversal 

initial state po(x T ) 

dissipation functional hi 

dual 

initial state po(xo) 

housekeeping Ashk/fe 

time-reversed dual 

steady state p ss {x T , X T ) 

excess A0 + As ex / k B 
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Chapter 3 

Review of Information 
Thermodynamics 


In this chapter, we review thermodynamics with measurements and feedback control. 
Historically, Maxwell pointed out that thermodynamics, specifically the second law of 
thermodynamics, should break down when an intelligent being, known as Maxwell’s de¬ 
mon, controls the system by utilizing information obtained by measurements. Since then, 
numerous researches have been done on the foundation of the second law of thermody¬ 
namics [56], and thermodynamics of information processing is established m Since the 
nonequilibrium equalities reviewed in the previous chapter are generalizations of the sec¬ 
ond law of thermodynamics, the second-law-likc inequality of information processing can 
be extended to nonequilibrium equalities. 

First, we trace historical discussions on Maxwell’s demon. Then, we formulate the 
second law of information thermodynamics from a modern point of view. Next, nonequi¬ 
librium equalities of information thermodynamics are reviewed. Finally, we review exper¬ 
imental demonstrations of Maxwell’s demon. 


3.1 Maxwell’s demon 

In this section, we review historical discussions on Maxwell’s demon. 

3.1.1 Original Maxwell’s demon 

Maxwell’s demon was proposed in his book titled “Theory of Heat” published in 1871 
ca. In the second last section of the book, Maxwell discussed the “limitation of the 
second law of thermodynamics” in an example with an intelligent being, which was later 
christened Maxwell’s demon by Lord Kelvin. 
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Figure 3.1: Schematic illustration of the gedankenexperiment of Maxwell’s demon, (a) 
Second law. Initially, the left box is filled with blue (slower and colder) molecules, and 
the right box is filled with red (faster and hotter) molecules. When the window of the 
division is open, the entire system becomes uniform in temperature, (b) Function of 
Maxwell’s demon. Initially, the temperature is uniform. The demon measures the velocity 
of molecules and let the red particles go through the window from the left box to the right 
box and let the blue particles go from the right to the left. Nonuniformity of temperature 
is then achieved without work or heat exchange. 


Let us consider a vessel filled with gas molecules. The vessel is divided into two parts, 
and the division has a small window, through which a molecule passes from one side to 
the other when the window is open. When the window opens and the temperature of 
one side is different from that of the other side, the second law of thermodynamics states 


that the temperature becomes uniform (see Fig. 3.1 (a)). Maxwell’s demon achieves the 
reverse process of this phenomenon (see Fig. 3.1 (b)). At an initial time, the temperature 
is uniform throughout the vessel. The demon observes molecules in the vessel. Some 
molecules are faster than the mean velocity and others are slower because of thermal 
fluctuations. The demon opens and closes the window to allow only the faster-than- 
average molecules to pass from the left side to the right side, and only the slower-than- 
average molecules to pass from the right to the left. After some time, the demon succeeds 
in raising the temperature of the right side and lowering the temperature of the left without 
the expenditure of work. This means that the entropy is reduced in an isolated system, 
which apparently contradicts the second law of thermodynamics. In summary, Maxwell 
demonstrated that the control of the system based on the outcomes of the measurement 
can reduce the entropy of the system beyond the restriction from the second law of 
thermodynamics. 
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Figure 3.2: Protocol of the Szilard engine. A single-particle gas particle is enclosed in 
the box, which is surrounded by a heat bath at temperature T. A partition is inserted 
in the middle, and the position of the particle is measured. Based on the measurement 
outcome, we decide in which direction we shift the division. We isothermally expand the 
division to the end and remove the division. 


3.1.2 Szilard engine 

In 1929, a simplest model of Maxwell’s demon, now known as the Szilard engine, was pro¬ 
posed [13]. Although the Szilard engine is apparently different from the original Maxwell’s 
gedankenexperiment, it captures the essential features of the demon of utilizing measure¬ 
ment and feedback control to reduce the entropy of a system. Moreover, the Szilard engine 
enables us to quantitatively analyze the role of the information. 


We elaborate on the protocol of the Szilard engine (see Fig. 3.2). An ideal classical 
gas molecule is confined in a box with volume V, and the box is surrounded by an 
isothermal environment with temperature T. We insert a division in the middle of the 
box and separate the box into two parts with the same volume V/2. Then, we measure 
the position of the particle to determine whether the particle is in the left or right part. 
We assume this measurement is error-free. When we find the particle is in the left, we 
isothermally shift the division to the right end. On the other hand, when we find the 
particle is in the right, we isothermally shift the division to the left end. In both cases, 
we can extract a positive work of k^T In 2 (see the discussion below for the derivation of 
this result) from the particle in these processes. We remove the division and the system 
returns to its initial state. Therefore, we can repeatedly extract work from this isothermal 
cycle. 

Szilard pointed out that the correlation made by the measurement is the resource for 
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the entropy reduction and work extraction. The measurement process creates a correlation 
between the position of the particle x and the measurement outcome y. Let us set the 
origin of x at the middle. When x > 0 (x < 0), we obtain y = R (y = L), where R (. L ) 
means the right (left) . After the process of feedback control, namely isothermal shifting, 
this correlation vanishes because the particle can now be present in the entire system, 
so x can be positive or negative regardless of the value of y. Therefore, in the feedback 
process, we extract a positive work at the cost of eliminating the correlation between x 
and y. 

Let us quantitatively analyze this protocol from a modern point of view. By the 
position measurement, we obtain the Shannon information 


(3.1) 


I = 111 2 . 


In the process of isothermal expansion, we extract work W ext . Because we assume that 
the gas is ideal, the equation of state reads pV = k B T. Therefore, the work is calculated 
as 



(3.2) 


Therefore, we conjecture that the information obtained by the measurement can be quan¬ 
titatively converted to the work during the feedback process. 

As explained above, Szilard revealed that we can utilize the correlation established 
by the measurement to reduce the entropy of the system. He also pointed out that the 
entropy reduction achieved in the feedback process must be compensated by a positive 
entropy production during the process to establish the correlation to be consistent with the 
second law of thermodynamics of the entire process. However, it remained unexplained 
why the measurement process should be accompanied by a positive entropy production. 

3.1.3 Brillouin’s argument 

An answer to the question was presented by Brillouin in 1951 He argued, in the 

original setup of Maxwell’s demon, that the demon creates more entropy when it observes 
molecules than the entropy reduction due to his feedback control. Therefore, the obser¬ 
vation process compensates the entropy reduction of the gas, and as a result the entire 
process is consistent with the second law of thermodynamics. 

Brillouin assumed that, when the demon observes molecules, the demon needs to shed 
a probe light to molecules. However, the demon and the system are surrounded by an 
environment at temperature T with the blackbody radiation. Therefore, the energy of 
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the probe photon should be sufficiently larger than the thermal energy k B T to distinguish 
the probe from background noises. Thus, the frequency of the probe photon v satisfies 

hv >> k B T, (3.3) 


where h is the Planck constant. The demon observes a molecule by absorbing a photon 
scattered by the molecule. Therefore, the entropy production of the demon by the single 
observation is given by 

his 

AiSdemon = ^ ^B- (3-4) 

Let T l (T r ) represent the temperature of the left (right) side satisfying 

T l = T - ^AT, T r = T + iAT, (3.5) 

where AT is the temperature difference satisfying AT <C T. The demon transfers a fast 
molecule in the left box with kinetic energy ^k B T{l + ei) to the right box, and does a 
slow molecule in the right box with kinetic energy | k B T{l — 62 ) to the left box, where 
61 , e 2 > 0, and e\ and e 2 are of the order of one. As a result, heat 

Q — -jk B T{e\ + e 2 ) (3-6) 


is transferred from the left box to the right box, and the entropy reduction is bounded as 

/ 1 1 \ AT 3 AT 

- AS sys < Q ( — - —J ~ Q— = -k B (e\ + e 2 )— < k B , (3.7) 


because AT/T <g; 1 and e\ + e 2 ~ 1. Therefore, comparing this equation with Eq. (3.4), 


we conclude that the entropy production of the demon due to the measurement is far 
beyond the entropy reduction achieved by the feedback control. Thus, the second law 
remains valid for the entire system. A similar discussion can be also done for the Szilard 
engine. 

I 11 this way, Brillouin argued that the entropy production for the measurement process 
exceeds the entropy reduction during the feedback process, and therefore the total entropy 
production of the system and the demon is positive. However, his analysis depends on 
a specific model of the measurement using a photon as the probe, and the idea that the 
work gain by the feedback control is compensated by the work cost of the measurement 
is not always true. 
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Figure 3.3: Structure of a 1-bit memory. A particle is confined in a bistable potential. 
In the zero (one) state, the particle is in the left (right) well. 


3.1.4 Landauer’s principle 

Landauer argued that the energy cost is needed not for measurement processes to obtain 
information but for erasure processes of the obtained information from the memory [[IS] . 
He considered a 1-bit memory consisting of a particle in a bistable potential as shown in 


Fig. T3 We label the particle in the left well as the zero state, and the particle in the 
right well as the one state. In the erasure process, we restore the particle to the standard 
state, namely the zero state. Before the erasure, we do not know whether the particle is 
in the left or right well. Therefore, the process is a two-to-one mapping, and cannot be 
realized by a deterministic frictionless protocol. Thus, a protocol must involve a process 
with friction to erase the information. In this way, dissipation is inevitable to erase the 
information stored in the memory. 

The erasure process in which the system is brought to the zero state is logically 
irreversible because we cannot recover the state before the process from the state after the 
process. Landauer argued that logical irreversibility implies physical irreversibility, which 
is accompanied by dissipation. Therefore, logical irreversible operations such as erasure 
cause heat dissipation. 

In a 1-bit symmetric memory, where the zero and one states have the same entropy, 
the erasure from the randomly-distributed state to the standard state means the entropy 
reduction by kp, In 2. To compensate this entropy reduction, heat dissipation must occur. 
Therefore, to erase the information stored in a 1-bit memory, we have inevitable heat 
dissipation of kpT In 2. This is a famous rule known as Landauer’s principle. 

In summary, Landauer argued that the cost for erasure of the stored information 
compensates the gain in feedback processes. However, his argument is crucially dependent 
on the structure of the symmetric memory, and does not apply to general cases. In fact, 
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erasure in an asymmetric memory provides a counter-example of Landauer’s principle 

153- 


3.2 Second law of information thermodynamics 

In this section, we review the second law of information thermodynamics from a modern 
point of view. We restrict our attention to classical cases, because it is sufficient for the 
aim of this thesis. First of all, we shortly introduce classical information quantities because 
they are crucial ingredients of information thermodynamics. Then, the second law of a 
system under feedback control is discussed. After that, the second law of a memory, which 
is a thermodynamic model of the demon, is presented. Finally, we demonstrate that the 
conventional second law is recovered for the entire system. 


3.2.1 Classical information quantities 

In this section, we introduce three classical information quantities: the Shannon entropy, 
the Kullback-Leibler divergence, and the mutual information based on Refs. mm- 


Shannon entropy 

First of all, we introduce the Shannon entropy. Let x denote a probability variable and X 
denote the sample space, namely x E X. When A is a discrete set, we define the Shannon 
entropy of a probability distribution p(x) as [|] 

S X \p] — ~ lnp(:r). (3.8) 

xex 

On the other hand, when X is continuous, we would naively define the Shannon entropy 
of a probability distribution density p(x) by 


— / p(x)dxln(p(x)dx) = — / dx p(x)lnp(x) — / dx p(x)ln(dx). 


(3,9) 


Jx Jx Jx 

However, the second term is divergent in the limit of dx —> 0. Therefore, we define the 
Shannon entropy as 


S*\p] = - / dx p(x)lnp(x). (3.10) 

Jx 

1 In the context of quantum information theory, the symbol S usually denotes the von Neumann 
entropy instead of the Shannon entropy. However, in this thesis, we denote the Shannon entropy by S 
because it is the convention in the field of classical statistical mechanics. There would be no confusion 
because we do not use the von Neumann entropy in this thesis. 
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Although p{x)dx is invariant under a transformation of variable, p(x) alone is not invari¬ 
ant. Therefore, the continuous Shannon entropy as defined in Eq. (3.10) is not invariant 


under transformation of the coordinates. 
The unaveraged Shannon entropy 


s(x) = — lnp(x) 


(3.11) 


is an indicator of how rare an event x is. In fact, s(x) increases as p(x) decreases. 
The Shannon entropy is the ensemble average of this rarity. The reason why we use the 
logarithm is to guarantee the additivity of the Shannon entropy when we have independent 
events. Let us assume X = X 1 x X 2 and x = (aq,^)- Then, when p(x) = Pi(xi)p 2 (x 2 ), 
we have 


S x (p) - S Xl (pi) + S* 2 (p2). 


(3.12) 


Here, we demonstrate that the Shannon entropy is invariant under a Hamiltonian 
dynamics. In this case, X is phase space, and the dynamics is deterministic. Let x\ (xf) 


denote the initial (final) position in X and pi (pf) denote the initial (final) probability 
distribution. Since the probability is conserved, we have 


Pi(xi)dxi = pf(xf)dx{. 


(3.13) 


In addition, Liouville’s theorem states that 


dxi = dx f, 


(3.14) 


which, together with Eq. (3.13), leads to 


Pi(Xi) = Pf(Xf). 


(3.15) 


Therefore, the initial Shannon entropy 



(3.16) 


has the same value as the final Shannon entropy 

S? = - [ dx f p(x f )\np(x f ), 
J x 


(3.17) 
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namely, 


s? = s?. 


(3.18) 


Thus, the continuous Shannon (3.10) entropy is invariant in time under the Hamiltonian 
dynamics. 


Kullback-Leibler divergence 

Next, we introduce the Kullback-Leibler divergence or the relative entropy. This quantity 
is defined as a relative logarithmic distance between two probability distributions p and 
q on the same sample space X. When X is discrete, we define the Kullback-Leibler 
divergence as 


s x \pM = ~'52p(x) ln 4w- 

x&X ^ ' 


(3.19) 


On the other hand, when X is continuous, we define the Kullback-Leibler divergence as 

S X \p\\q\ = — [ dx p(x) In (3.20) 

J x P\ x ) 

We note that the continuous Kullback-Leibler divergence is invariant under a transforma¬ 
tion of the coordinates. 

Using an inequality 


1 _ 1 

p(x) ~ p(x ) 


we obtain 


S X \p\\q] > 


dx p{x) 


>x 


q(x) 

p(x) 


dx q(x) + dx p(x) 


>x 


' X 


0, 


(3.21) 


(3.22) 


where the equality is achieved if and only if p(x) = q(x) (p -almost everywhere). There¬ 
fore, the Kullback-Leibler divergence is a kind of distance to measure how different two 
probabilities are. 
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Figure 3.4: Schematic illustration of the definition of the mutual information. The two 
circles represent degrees of uncertainty of X and y. The union of the circles represents a 
degree of uncertainty of X x y. Therefore, the mutual information defined by Eq. (3.23) 
corresponds to the overlap of the degrees of uncertainty of X and jL, which describes 
correlations between X and y. 


Mutual information 

We consider the mutual information between two sample spaces X and y . Let p Xxy (x, y) 
denote a joint probability distribution of (x, y) G X x y. The marginal probability 
distributions are defined as p x (x) = f y dy p Xxy (x,y) and p y (y) = f x dx p Xxy (x, y). We 
define the mutual information as 


r*:* = S X^ X] + _ sXxy^Xxy^ 


(3.23) 


which represents the overlap of uncertainty of X and y (see Fig. 3.4). If the systems 
are independent of each other, i.e., p(x,y) = p x (x)p y (y), we obtain 1 = 0 due to the 
additivity of the Shannon entropy. Moreover, we represent the mutual information in 
terms of the Kullback-Leibler divergence as 


! x ' y — S Xxy \p Xxy \\p x p y ], 


(3.24) 


Therefore, the mutual information quantifies how different p Xxy is from the non-correlated 
probability distribution p x p y , that is, how correlated X and y are. Since the Kullback- 
Leibler divergence is not negative, we obtain 


I x - y > 0 . 


(3.25) 
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tf Zf 

Figure 3.5: Flow chart of feedback control. From time t\ to t m , the system evolves in 
time by a Hamiltonian H(z, A). At time t m , we perform a measurement on the system 
and obtain an outcome m. We conduct feedback control from time t m to tf. In other 
words, the time evolution after the measurement is governed by a Hamiltonian H(z, X m ) 
that depends on the measurement outcome m. 


The explicit form of the mutual information is 


P x (x)p y (y) 


I x ' y = - dx dyp Xxy (x,y) lm 

Jx Jy p* xy (x,y) 


(3.26) 


Therefore, we define the unaveraged mutual information as 

p x (x)p y (y) 


i(x, y) = — In 


p Xxy (x, y) 


(3.27) 


or 


i{x, y) = — In p x (x) + lnp x ^ y (x\y), (3.28) 

where p xy (x\y) is the probability distribution function of x conditioned by y. Therefore, 
the unaveraged mutual information quantifies the decrease of the unaveraged Shannon 
entropy of the system X clue to the fact that we know the system y is in a state y. 

3.2.2 Second law under feedback control 

In this section, we formulate the second law of a system under feedback control. The 
mutual information plays a crucial role in quantifying the gain of feedback control. 
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Setup 


We consider feedback control on a Hamiltonian system with phase space Z (see Fig. 3.5). 
At initial time £ i; the system is at position Z[ sampled from an initial probability distri¬ 
bution Pi(zi). From time U to f m , the system evolves under a Hamiltonian H(z, A), and 
ends up with a point z m with a probability distribution p m (z m ). At time t m , we measure 
quantities of the system (e.g. positions, velocities and the number of particles in a given 
region) and obtain a measurement outcome m. Let Af denote the sample space of the 
outcome. The detail of the measurement is modeled by conditional probabilities 


p(m\z m ), 


(3.29) 


which is the probability to obtain the outcome m under the condition that the system is 
at the position z m at time t m . Now that we have the outcome m , we know more detailed 
information on the system than we did before the measurement. In fact, the probability 
distribution of z m under the condition that we have obtained m is calculated by the Bayes 
theorem as 


Pm(z m \rn) 


P(m\z m )p m (z m ) 
p(m) 


(3.30) 


where the probability distribution of the outcome m is defined by 


p{m ) 


dz m p(m\z m )p m (z m ). 


(3.31) 


From time t m to t { , the system evolves under a Hamiltonian H(z,X m ). Here, we conduct 
feedback control of the system, that is, adjust the Hamiltonian in accordance with m via 
the control parameter X m . Therefore, the protocol after the measurement is conditioned 
by m. Let Zf denote the final position and pf(zf\m) be the final probability distribution 
conditioned by m. The unconditioned final probability distribution is calculated as 


Pffcf) = E Pf(zf\m)p(m). 

m£A4 


(3.32) 


Shannon entropy production 


We evaluate the Shannon entropy production in the above-described setup. Since Hamil¬ 


tonian dynamics does not change the Shannon entropy (see Eq. (3.18)), we obtain 


s z M = s z \ Pm }. 


(3.33) 
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By the same reason, we also obtain 


S z \p m {-\m)\ = S z \pf(-\m)\, v m G M. 


(3.34) 


To evaluate entropy production of the system, let us compare the averaged final entropy 


Sf = Y p{jn)S z \pi(-\m )] 


mgM 


with the initial entropy 


sf = s z M 


By Eq. (3.34), the hnal entropy is calculated as 


mg M 


sf = Y P(™) sZ \Pm(-\m )] 

I dz m E p(m)p m (z m \m) \np m (z m \m) 

** Z meM 

I dz m E Pm(zm,m) \np m (z m \m). 

^ Z m£M 


On the other hand, by Eq. (3.33), the initial entropy can be transformed as 


S[ — S [pm] — J dz m Pm(^'m) hlp m (z m ) 

= - dz m Y Pm(z m ,rn) \np m (z m ). 

m£M 


Using Eq. (3.28), we obtain 


Sf - S z = - f dz m Y *( 2 m,m) = -I Z:M - 
d Z meM 


(3.35) 


(3.36) 


(3.37) 


(3.38) 


(3.39) 


Therefore, the Shannon entropy production of the system is the negative of the mutual 
information obtained by the measurement. 


Second law under feedback control 

We separate the degrees of freedom 0 into that of the system x and that of the heat bath 
b as z = (x, b). Let X and B denote phase spaces of the system and the bath, respectively. 
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We assume that the total Hamiltonian reads 


H z (z, A) = H x (x, A) + H B (b) + H' m \z , A), 


(3.40) 


where the last term on the right-hand side is the interaction Hamiltonian and is assumed 
to vanish at the initial and final times. 

First, we assume that the initial state is the product state of an initial probability 
distribution of the system and the canonical ensemble of the heat bath as 


P?(z i) = P?( x i)Pe q (bi), 

where we define 


p® (b) = 


and 


e -P P B 


J db 


(3.41) 


(3.42) 


(3.43) 


In this case, since x\ and 6; are not correlated, the initial total Shannon entropy is calcu¬ 
lated as 


Sf = Sf + P{(H b ) i - F b ), (3.44) 

where the initial energy of the bath is defined as 

(H B ), = [ db i (6,). (3.45) 

Jb 

Using the final probability distribution of the total system pf(xf,6f), we define the 
marginal final probability as 

pf(x f ) = [ db { pf(xi,b { ). (3.46) 

Jb 

We calculate the relative entropy between the final state of the total system and the 
product state of the final state of the system and the canonical state of the heat bath as 

S Z [pf\\pfPeq\ = - [ dxf f dbf pf(xi,b { )\npf {x i )e~^ H ' S{bi) ~ FB) - Sf 

J x Jb 

= Sf + P{(H B ) f - F b ) - Sf. (3.47) 
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Since the relative entropy is positive, we obtain 


Sf < S? + F b ). 


(3.48) 


Therefore, comparing Eq. (3.39), we obtain 


A S* + pQ > -I Z:M , 


(3.49) 


where we identify the dissipated heat with the energy difference of the bath as 


Q = (H B ) f - (H B ) i. 


(3.50) 


Moreover, since the measurement outcome m should depend only on the degrees of free¬ 
dom of the system x m , we have 


p(m\z m ) = p(m |x m ), 


and therefore 


Thus, we obtain 




-In p z (z m ) + \np(z m \m) 

— In p M (m) + lnp(m| 2 m ) 

— In p M (m) + lnp(m|a: m ) 


AS* + PQ > ~I X:M 


(3.51) 


(3.52) 


(3.53) 


The left-hand side of Eq. (3.53) means the sum of the Shannon entropy production of the 


system and the entropy production of the heat bath. Therefore, Eq. (3.53) demonstrates 
that the total entropy production can be reduced by the feedback control by the amount 
of the mutual information I* :M . 

To proceed further, we assume that the initial state of the system is in equilibrium 
with the inverse temperature /3 given by 


Pi(? i) =Peq( X 


(3.54) 
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where we define the canonical ensemble as 


p^(x, A) = 


(3.55) 


and the free energy as 


e ~pF x (\) = f dx e -p H * {x ,X) 

Jx 


(3.56) 


The initial Shannon entropy is 

S x = -J dxip?(xi)lnp?(xi) 

= 

where we define the initial energy of the system as 

(H x ) i= [ dxipf(xi)H x (xi,Xi). 

J x 

On the other hand, since the relative entropy is positive, we obtain 


(3.57) 


(3.58) 


S X \p?(-\rn)\\p x q ('Am, { )}>0. 


(3.59) 


This relation can be rewritten as 


P I dx f pf {x i \m){H x (x u A m , f ) - F x ( A m , f )) - S x \pf(-\m)\ > 0 


tX 


X i 


?x \^X, 


lx 


Averaging this over m with the probability p(m), we obtain 


P((H x ) f - F?) > S?, 


X\ ^ cA” 


where we define the final internal energy as 


(H X )f= / dx t 2^ pf(x i , m)H x (x i ,X x J ) 

d X m&M 


(3.60) 


(3.61) 


(3.62) 


and the final free energy as 


F f x = P(m)F x { A m , f ). 


(3.63) 
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Therefore, Eq. (3.53) reduces to 


f3((AH x ) - AF x + Q)> -I X ' M . (3.64) 

Identifying the energy change of the total system (A H x ) +Q with the work W performed 
on the system, we obtain 


W — AF X > - k B TI X:M . 


(3.65) 


We can rewrite this equation as 

-W<-AF X + k B TI X:M , (3.66) 

which means that we can extract more work from the system than the conventional second 
law of thermodynamics by the amount of the mutual information I X:M obtained by the 
measurement. This form of the second law under feedback control was first formulated in 
Ref. [18] in a quantum system. 

In summary, we formulate the second law under feedback control, and reveal that the 
gain of feedback control is precisely quantified by the mutual information obtained by the 
measurement. This is a rigorous formulation of Szilard’s idea that the correlation made 
by the measurement can be utilized as a resource for the entropy reduction. 

3.2.3 Second laws of memories 

In this section, we formulate second laws of memories during a measurement process 
and during an erasure process. The second laws of memories were discussed in quantum 
systems in Ref. [19]. Here, we consider a classical version of this study. 

Measurement process 

First, we consider a measurement process. The phase space of the total system is denoted 
by Z and the system is subject to a Hamiltonian dynamics. We assume that the total 
system consists of three parts: the system, the memory, and a heat bath with phase spaces 
A, (V, and H, respectively. Phase-space positions in A, A, jk, and B are denoted by z, x, 
y , and b, respectively. Moreover, we assume that the sample space of the memory y is 
the disjoint union of y m (m 6 M. — {0, • • • , M}). An outcome m is stored when y G y m . 
Therefore, M. can be regarded as a coarse-grained sample space of y. We assume that 
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the total Hamiltonian is decomposed into 


H z (x , y , b , A) = if*(x, A) + if y (j/, A) + H B (b) + H int (x , y, b , A), (3.67) 


where the last term on the right-hand side is the interaction term, which is assumed to 
vanish at the initial and final times. 

Since the total system is a Hamiltonian system, the Shannon entropy of the total 
system is conserved: 


Sf = Sf 


(3.68) 


We assume that the initial state is a product state as 


i) = P?(xi)P? {Vi)PeM, 


(3.69) 


and then we obtain 


Sf = Sf + Sf + (3({H b ) i - F b ). 


(3.70) 


Because of the positivity of the relative entropy, we obtain 




» Sf* y + - F B ) > Sf 


(3.71) 


Substituting Eqs. (3.68) and (3.70), we obtain 


Sf xy - Sf - Sf + /3Q > 0, 


(3.72) 


where 


Q = (H B )f - (77 6 )i 


(3.73) 


Using the definition of the mutual information, we obtain 


AS* + AS y + PQ > I X:y . 


(3.74) 


Since we perform feedback control based on m, we should evaluate the entropy production 
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by l A ' M instead of I A ' y . The difference between these two values is calculated as 


rX-.y jX-.M _ 


, / , xxy ( \ i Pf(xt)pf(yt) 

dx f / dy t p f ^{Xf, yi) In -- 

Pt y (x f, 2 / f ) 


' x Jy 


'x 


(3-75) 


meVl 


where the joint probability of A and A4 is defined as 


p?** l (x u m)= dytp? x *(xt,yt). 

Jy m 


Therefore, we obtain 

- I XM = -/*<£ [ dy, V<) In 

Jx J ym Pi 

Since the right-hand side is in the form of the relative entropy, we derive 

jX-.y _ jX:M > q 


(3.76) 


(3.77) 


(3.78) 


We note that this result is natural since Ml is the coarse-grained sample space of y. Thus, 


Eq. (3.74) reduces to 


AS* + AS y + PQ > I X:M . (3.79) 

We assume that, before the measurement process, the system is in equilibrium and 
the memory is prepared to be a fixed standard state p^ t (yi), namely, a local equilibrium 
state in 33) given by 



(3.80) 


where we define 


*(9, A) = (3.81) 

and Xy m (y) i s th e characteristic function of a region y m \ the conditional free energy is 
defined by 


e -pF y ™(\) = [ dy e -PHy{y,X) 

Jy m 


(3.82) 
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The initial entropy of the memory is 


s? = wi/y, - f?). 


Using the positivity of the relative entropy, we obtain 


(3.83) 


S y \pf\\pf ei ] > 0, 


(3.84) 


where we define a reference probability by 


Pre M = X P( m )Pe<T(l/{,^n,{) 

m£A4 


(3.85) 


Therefore, we obtain 




- dyt pf(y { ) In X P(m)p^(yi,X mti ) > Sf 
^ m&M 

- X / dm Pl(yf) ln P( m )Peq(yf^m,f) > Sf 


m£M ' J y™ 


S M +/3((H y ) { - F y ) > S y , 


(3.86) 


where we use the fact that p y ™ = 0 outside y m , and the final free energy is defined with 

F? = X P(™)F ym {\ m , f). (3-87) 


mdAi 


Therefore, the entropy production of the memory is bounded as 


A S y <S M + P(A(H y ) - A F y ). 


(3.88) 


On the other hand, we can derive 


AS* = Sf-S* 


< S* + S\p*\\p* q ,]-S* 
= (3(A(H X ) - AF X ). 


(3.89) 


Using Eq. (3.74), we obtain 


/3(A (H x ) + A (H y ) - A F x - A F y + Q) > I X:M - S M . 


(3.90) 


Since the energy change of the total system A (H x ) + A (H y ) + Q should be identified as 
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work W, we conclude 


W - A F x - A F y > k B T(I x - M -S M ). 


(3.91) 


This equality presents the minimum work needed to perform the measurement, which 
may be interpreted as a rigorous formulation of Brillouin’s argument. 

Erasure process 

Next, we consider an erasure process of the stored information. We assume that the total 
system of this process consists of the memory y and the heat bath B, and the total system 
is subject to the Hamiltonian dynamics. We assume that the Hamiltonian is written as 


H yxB (y, b , A) = H y (y, A) + H B (b ) + H mt (y, b , A), 


(3.92) 


where H mt is the interaction term, which is assumed to vanish at the initial and final 
times. 


Since the total system is under Hamiltonian dynamics, we obtain 


gyxB _ gyxB 


— D f 


(3.93) 


We assume that the initial state is a product state as 



(3.94) 


and we obtain 



(3.95) 


By the same procedure to derive Eq. (3.48), we obtain 


S yxB < Sf + (3((H B ) f - F b ). 


(3.96) 


Therefore, Eq. (3.93) reduces to 


A S y + PQ> 0, 


(3.97) 


where the dissipated heat is defined as 


Q = (H B )f - (H B ) i. 


(3.98) 
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We assume that the memory initially stores a classical probability distribution p(rn) 
and it is in the local equilibrium state of y m under the condition that the stored infor¬ 
mation is m, i.e., 

P y (Vi) = P( m )Peq(yii A 0‘ ( 3 - 99 ) 

m£A4 

Therefore, the initial entropy of the memory is calculated as 

S? = / dyi ^2 p(m)p^(y u A;) In p{m’)p y ™'(y u A t ) 

m&M m'£M 

= / d yiP( m )Peq(y^ X i) ln P( m ')Peq' (VU A i) 

m^Ai m'^Ai 

= / 4 PHfi'fei ^i) ln P( m K m (3/ii ^i) 

m£M Jy™ 

= S M +P((H y ) i -F?). (3.100) 


On the other hand, by the positivity of the relative entropy, we obtain 


S? < - if), 


(3.101) 


which reduces Eq. (3.97) 


to 


f3(A(Ft y ) — AF y + Q) > S M . 


(3.102) 


We identify A (H y ) + Q as work W, since it is the energy increase of the total system. 
Therefore, we conclude 


W - AF y > kuTS’ 


■M 


(3.103) 


which reveals the minimum work needed to erase the information stored in the memory. 
When F ym (\f) = F y °A) f° r arbitrary m, we have no free-energy difference and 


Eq. (3.102) reduces to 


W > k n TS M . 


(3.104) 


In other words, in symmetric memories, Eq. (3.102) reduces to Landauer’s principle, which 


states that the minimum cost for erasure is the Shannon entropy of the stored information. 
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3.2.4 Reconciliation of the demon with the conventional second 
law 

In this section, we summarize the results obtained in the previous sections, and demon¬ 
strate that the second law is recovered in the total process [19]. 

The second law under feedback control is 

Wfb-AF* > -k B TI X:M . (3.105) 

The minus of the right-hand side quantifies the energy gain of feedback control, or the 
extractable work beyond the conventional second law thanks to Maxwell’s demon. In the 
measurement process, we have obtained 

> k B T(I XM - S M ), (3.106) 

whose right-hand side represents the additional work cost needed for the demon to perform 
the measurement. In the erasure process, we have derived 

W eias — AF e ^ as > k B TS M . (3.107) 

The right-hand side is the additional work cost for the erasure of the information stored 
by the demon. Therefore, the sum of the costs for both the measurement and erasure is 
given by 


W m+ , - AF x + , - AF^ +e 


> k B TI 


X:M 


(3.108) 


where we define W m+e = W meas + W eras and other quantities in a similar way. We see that 
the work gain by the demon is precisely compensated by the work cost that the demon 
pays for the measurement and erasure. In fact, in total, the information quantities on the 


right-hand sides of Eqs. (3.105) and (3.108) are cancelled out, and we obtain 


W M - AF* - AF* > 0, 


(3.109) 


where we define W tot = W meas + Wfy, + IlVas and other quantities in a similar way. The 


conventional second-law-like inequality (3.109) is recovered in the total process consisting 


of the measurement, feedback control, and erasure processes. In particular, in an isother¬ 


mal cycle, Eq. (3.109) reduces to Wtot A 0, which is nothing but Kelvin’s principle for the 


isothermal composite system consisting of the system and the memory. 

As reviewed in Sec. 2.1, Brillouin argued that the entropy reduction by the demon is 
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compensated by the cost for the measurement. On the other hand, Landauer’s principle 
says that the work cost for the erasure of the stored information exceeds the work gain 
of the feedback control. Although these views are true for some specific systems, they 
are not generally true. What compensates the work gain by the demon in general is 
not the individual work cost for the measurement or erasure but the joint work cost for 
the measurement and erasure processes. The information-thermodynamic inequalities 
summarized above reveal that the reconciliation of Maxwell’s demon with the second law 
is achieved in a rigorous manner. 

In terms of the Shannon entropy, we have obtained 


AS^ + (3Qr, 

> 

tX:M 

1 ? 

(3.110) 

^^meas T AS^ eas + /3Q m eas 

> 

tX-.M 

1 i 

(3.111) 

+ ft Q eras 

> 

o, 

(3.112) 

ASj* +e + A S^ +e + (3Q m+e 

> 

0, 

(3.113) 

AS* + AS& + (3Qtot 

> 

0. 

(3.114) 


The inequalities (3.111) and (3.112) are simpler than Eqs. (3.106) and (3.107) in that 
they do not have the Shannon entropy term S M explicitly. We note that the inequalities 
on the Shannon entropy (3.110), (3.111), (3.112), (3.113), and (3.114) are stronger than 
the inequalities on the work (3.105), (3.106), (3.107), (3.108), and (3.109), since the latter 
inequalities can be derived from the former inequalities based on positivity of the relative 
entropy. 


3.3 Nonequilibrium equalities under measurements 
and feedback control 

The second-law-like inequality under measurements and feedback control derived in the 
previous section can be generalized to nonequilibrium equalities as the conventional second 
law is generalized to the nonequilibrium equalities reviewed in Sec 2.1. 

In this section, we review information-thermodynamic nonequilibrium equalities. Then, 
we derive these equalities in a unified manner similar to that of Sec. 2.2. 

3.3.1 Information-thermodynamic nonequilibrium equalities 

In 2010, Sagawa and Ueda generalized the Jarzynski equality in a Markov stochastic 
system under feedback control based on a single measurement and obtained the Sagawa- 
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Ueda equality 


( e -MW-&F)-ij _ j (3.115) 

where / is the unaveraged mutual information obtained by the measurement. Using 
Jensen’s inequality, we succinctly reproduce the second law of information thermodynam¬ 
ics as 


(W) - AF >-k B T(I). (3.116) 

Later, the Sagawa-Ueda equality is generalized to Markov systems with multiple mea¬ 
surements [23], and to non-Markov systems with multiple measurements [55] . 

The Sagawa-Ueda equality has variants as the Jarzynski equality has the variants 
reviewed in Sec. 2.1. The Hatano-Sasa relation is generalized to systems under feedback 
control as [60] 


/g A0 Asex/kji /\ _ ^ 


(3.117) 


The associate inequality is given by 


(A0) + (As ex )/k B >-(/)■ 


(3.118) 


Moreover, in Ref. [60], the Seifert relation is generalized to 


Astot/k B —1\ _ 


(3.119) 


which leads to a second-law-likc inequality 


(Astot) > ~k B (I). 


(3.120) 


Reference EU derived 


( e -As hk -/^ = 


(3.121) 


and 


(Ashk) > —k B (I). 


(3.122) 


Equations (3.115), (3.117), (3.119), and (3.121) are summarized in terms of the formal 
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entropy production o as 


( e - CT - 7 ) = 1. (3.123) 

This equality is a general nonequilibrium equalities under measurements and feedback 
control. The second law of information thermodynamics is reproduced as 

W) > -(!)■ (3-124) 

3.3.2 Derivation of information-thermodynamic nonequilibrium 
equalities 

In this section, we derive the nonequilibrium equalities under measurements and feedback 
control. 

Setup 

We consider a nonequilibrium process in a classical non-Markov stochastic system with 
feedback control from time t\ to tf. To formulate the dynamics of the system, we discretize 
the time interval into N parts and define t n = ti + n(tf — ti)/N (n — 0, • • • , N). Let X and 
y denote phase spaces of the system and the memory, respectively. Moreover, we denote 
the phase-space positions in X and y at time t n by x n and y n , respectively. An external 
parameter to control the system is denoted by A, and the value of A at time t n is A n . 
Initially, the system and memory are at (x 0 , y 0 ) sampled from an initial joint probability 
distribution p 0 (xo,yo)- Then, the system evolves from time to to t\ and a measurement 
is performed at time t\ to obtain a measurement outcome y\ . After that, the system is 
subject to feedback control driven by an external parameter Ai( 2 / 1 ) from time t,\ to t 2 - In 
this way, we repeat measurements and feedback control. We define X n = (x 0 ,Xi, • • • ,x n ) 
and Y n = (y 0 , • • • ,y n ), and therefore Xn and Yn- 1 represents the entire trajectory of 
the system and memory, respectively. Since A n is adjusted based on the measurement 
outcomes before time t n , X n is a function of Y n . Therefore, the non-Markov dynamics of 
the system is determined by transition probabilities 


p(x n+ i\X n , A„(y n )). (3.125) 

On the other hand, since the measurement outcome depends on the trajectory of the 
system before the measurement, the dynamics of the memory is determined by 


p(y n \X n ). 


(3.126) 
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Therefore, the joint probability to realize Xn and Y/v-i is calculated as 


P[X n ,Y n _ r] 


N-l 

]Q p(x n+1 \X n , X n (Y n ))p(y n \X n ) -p(xi\xo, X 0 (y 0 ))po(x 0 ,yo) 

n =1 


P te [X JV |xo,Aj V -i(yjv-i)]P tr [nr-i|X iV _i]po(a:o,J/o), 


(3.127) 


where we define the transition probabilities as 


P tT [X N \x o, Atv-i(Uv-i)] 

N-l 

p(x n -|_l ^77, (^77.))? 

n-, _n 

(3.128) 

P u [Y n _ iIA^i] 

n —u 

N-l 

= I[p(yn\Xn). 

n= 1 

(3.129) 


Dividing Eq. (3.127) by P[Y N _ i], we calculate the conditional probability as 

, P tT [YN-i\X N -i]p 0 (x o, yo ) 


=p tr [x JV |A Af _,(1^—0]- 


P[Yat_iHOo) 


(3.130) 


Therefore, the conditional probability is different from the transition probability under 
measurements and feedback control. 

Following Ref. [52], we define the information obtained by the measurement at time t n 
as the mutual information between y n and X n under the condition that we have obtained 
i, that is, 


In [Vn j A n | ^ n —1 ] 


- hip(y n \Y n _ 1 ) + \wp{y n \X n , Y n _ x ) 

- \np(y n \Y n _i) + \wp{y n \X n ). 


The sum of the mutual information is calculated as 

n—1 


/ = J] 4 = ^lnP[r JV _ 1 |j/o] + lnP tr [r J v_i|X iV _i] 

P[4v-l] 


77.—1 


= — In 


(3.131) 


(3.132) 


P^iYN^lXN.MyoY 

which is the total information obtained by all the measurements. Using the total mutual 


information, we can transform Eq. (3.130) as 


P^AbvIAw-itUv-O] 

P[X N \Y N -i] 




(3.133) 
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where /; is the mutual information at the initial time defined by 


I\ — — In 


Po(x 0 )p 0 (y 0 ) 


(3.134) 


Po(x 0 ,y 0 ) 

Derivation of information-thermodynamic nonequilibrium equalities 

The formal entropy production a is defined as the ratio of the reference probability to the 


original probability under a fixed protocol as in Eq. (2.32). Therefore, we obtain 


P 1 [Xn | Ay -1 (Yn -i ) ] = e - o -pq v |A J v- 1 0'jv- 1 )] 
^[XjvlAiv-r^iv-r)] 

We define the joint probability of the reference process as 


(3.135) 


P r [ Wv,Yv—r] = P^A^l^-l^P^-l], 


(3.136) 


which means that we sample the reference process conditioned by Y ^_i with the same 


probability P[Y N _ i] as the original process. From Eqs. (3.127) and (3.136), the ratio of 
the reference joint probability to the original joint probability is calculated as 


PVCnjYn-i 

P[X n ,Y n _ i 


P r [-^~AT | Ajv_ ! (T/V— t )] P[Y N -l}p o(xq) 
P tr [- 5 fv|Ai V - 1 (hiv- 1 )] P tr [h) V - 1 |- 5 fiv-i]po(^o, yo) 


- (3.137) 


= e 


Multiplying both sides by P[Xn, Ev-i] and integrating over Xn and Y/v-i, we obtain 


-°-i-h\ - 


) = 1 , 


(3.138) 


because the reference probability is normalized to unity. In ordinary feedback protocols, 
the system is assumed not to be correlated with the memory at the initial time. Therefore, 


Eq. (3.138) reduces to 


( e_f7_/ ) = I- 


(3.139) 


The appropriate choices of the reference probability explained in Sec. 2.2 reduce Eq. (3.139) 


to Eqs. (3.115), (3.117), (3.119), and (3.121) 


3.4 Experiments 

In this section, we briefly review experimental demonstrations of Maxwell’s demon. 
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Figure 3.6: (a) Two washboard potentials induced by an electromagnetic field. If 

the Brownian particle climbs up the potential due to thermal agitation, the potential is 
switched to the other potential to prevent the particle from descending, (b) Free-energy 
gain of the Brownian particle subtracted by the amount of work done on it. The abscissa 
represents the delay time e it takes to switch the potential after the position measurement. 
The first two data points show the net free energy gain beyond the conventional second 
law of thermodynamics. Reproduced from Figs. 2b and 3d in Ref. [16] • Copyright 2010 
by the Macmillan Publishers Limited. 


The first experimental realization of Maxwell’s demon was done by Toyabe et al. [IB] . 
They demonstrated that the free energy of a Brownian particle can be increased by feed¬ 
back control based on measurements of the position of the particle. A couple of polystyrene 
beads are suspended in a water with one of them anchored to a glass plate. The other 
bead can move on a ring, and is subjected to a washboard potential created by four 


electrodes (see Fig. 3.6 (a)). At a certain instant in time, the position of the particle is 
measured. After a delay time e, if the particle climbs up the potential due to thermal 
agitation, the potential is switched to the other potential to prevent the particle from 
descending; otherwise the potential is left unchanged. This protocol of feedback control 
is repeated. As a result, the particle is able to gain free energy larger than the work done 
on it in this feedback-controlled process (see Fig. 3.6| (b)), namely, it was demonstrated 
that information obtained by the measurements can be used as a resource for free energy. 

The information-thermodynamic nonequilibrium equality (3.115) was experimentally 
verified in Ref. im A feedback-controlled two-state system similar to the Szilard engine 
was implemented in a single-electron box (SEB) illustrated in Fig. 3.7 (a). The gate 


voltage V g in Fig. 3.7 (a) is adjusted so that the minimum charging energy is achieved 
when the average number n of the electrons that have tunneled from the left island to 
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Figure 3.7: (a) Schematic illustration of the system demonstrating the information- 

thermodynamic nonequilibrium equality (3.115). The single-electron box (SEB) consists 
of two metallic islands connected by a junction, through which electrons are transported 
by tunneling. The number n of excess electrons in the right island is monitored by a single- 
electron transistor (SET). The ground-state average value n g of n is controlled by the gate 
voltage V g . (b) Experimental verification of Eq. (3.115). The abscissa represents the error 
rate of the measurement of n. The data shown by black squares deviate significantly from 
unity, showing the breakdown of the Jarzynski equality, whereas the red squares show 
that Eq. (3.115) holds. Reproduced from Figs. 1 (a) and 3 (b) in Ref. [17] . Copyright 
2014 by the American Physical Society. 


the right one is n g = 0.5. The temperature of the system is low enough that the SEB is 
in either the n = 0 or n = 1 state. Therefore, the state with n = 0 is realized with the 
probability of 0.5, so is the state with n — 1. The state of the SEB is monitored by a 


single-electron transistor (SET) (see Fig. 3.7 (a)), and the feedback control is conducted 
by changing V g based on the value of n to extract work from the SEB. The average in 


Eq. (3.115) was experimentally confirmed to be unity within experimental errors as shown 


in Fig. 3.7 (b). 
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Chapter 4 

Nonequilibrium Equalities in 
Absolutely Irreversible Processes 


As discussed in Chap. 2, nonequilibrium equalities apply to rather general nonequilibrium 
situations. However, it is known that integral fluctuation theorems are inapplicable to 
some situations. We propose a new concept of absolute irreversibility as a novel class of 
irreversibility that encompasses the entire range of those situations to which conventional 
integral nonequilibrium equalities cannot apply. In mathematical terms, the absolute 
irreversibility is defined as the singular part of the reference probability measure, and 
is uniquely separated from the ordinary irreversible part by Lebesgue’s decomposition 
theorem [25, [2B]. We derive nonequilibrium equalities that are applicable to absolutely 
irreversible processes based on measure theory. Inequalities derived from our nonequi¬ 
librium equalities give a positive-definite lower bound of the entropy production when a 
process involves absolute irreversibility. 

First of all, we consider free expansion to illustrate that absolute irreversibility causes 
inapplicability of conventional nonequilibrium integral equalities, and define absolute ir¬ 
reversibility in terms of measure theory. Then, we derive nonequilibrium equalities in 
absolutely irreversible processes based on Lebesgue’s decomposition theorem. Next, we 
verify our nonequilibrium equalities in several examples. Finally, we compare our method 
with a conventional method and discuss merits of ours. 

In this chapter, we restrict our attention to systems without measurements and feed¬ 
back control. This chapter is mainly based on Ref. E3- 
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Figure 4.1: (a) Forward process of free expansion. An ideal single-particle gas is ini¬ 

tially in a local equilibrium in the left box with temperature T. Then, the partition is 
removed, and the gas freely expand to the entire box. (b) Backward process of free ex¬ 
pansion. Initially, the single-particle gas is in a global equilibrium of the entire box with 
temperature T. Then, the partition is inserted, and the gas particle is in either the left 
or right box. The backward path ending in the right box (indicated by the blue arrow) 
has no corresponding forward path. Therefore, this is a singular path with a negatively 
divergent entropy production. Reproduced from Fig. 1 of Ref. 122] Copyright 2014 by 
the American Physical Society. 


4.1 Inapplicability of conventional integral nonequi¬ 
librium equalities and absolute irreversibility 

In this section, we introduce absolute irreversibility in an example of free expansion, to 
which the Jarzynski equality cannot apply. Then, we mathematically define absolute 
irreversibility in terms of measure theory. 


4.1.1 Inapplicability of the Jarzynski equality 


The Jarzynski equality is known to be inapplicable to free expansion [ 211 122] , Here, we 
illustrate this fact. Suppose that an ideal single-particle gas at temperature T is prepared 
in the left side of a box with a partition as illustrated in Fig. 4.1 (a). Then, we remove the 


partition and let the gas expand to the entire box. In this process, work is not extracted 
(W = 0), whereas the free energy decreases (A F < 0). Therefore, the dissipated work is 
always positive: 


W — AF > 0. 


(4.1) 
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Thus, we have 


,-/3(W-A 


F) ) < i, 


(4.2) 


which means that the Jarzynski equality (2.4) is not satisfied in this process [21]. In 
physical terms, this is because the Jarzynski equality assumes that the initial state is 
in a global equilibrium and this assumption is not satisfied in the present case. Recall 
that, in free expansion, the initial state is not a global equilibrium state, but only a local 
equilibrium state [22]. Therefore, the Jarzynski equality cannot apply to free expansion. 
Then, it is natural to ask why a local equilibrium state cannot be assumed as an initial 
condition of the conventional integral nonequilibrium equality. 

The mathematical reason is that we have paths with a divergent entropy production 
when we start from a local equilibrium state. This statement is illustrated in free expan¬ 
sion as follows. We consider a set of virtual paths {Tr} starting from the right box. By 
assumption, the probability of these paths in the forward process vanishes: P[Tr] = 0 . 
On the other hand, the probability of the corresponding backward paths is nonvanishing: 

7 ^ 0. Therefore, we have 


! r, p[r; = o & p’lr*; ^ o. 


(4.3) 


For these paths, the entropy production is negatively divergent in the context of the 


Crooks fluctuation theorem ( 2 . 12 ) because 


«r = P(W - AF) = - In -> -oo 


(4.4) 


and 


e~ a = 


pt[Tt] 

~W\ 


—>■ oo. 


(4.5) 


Due to the paths with a divergent entropy production, the conventional integral nonequi¬ 
librium equality breaks down: 


e~ a ) = [ e- a V[T]VT 

Jv[ r]^o 


P t [T t ]PT t 


Jv[ r]^o 

= 1- f V ] \T ] \DY ] 
Jv[T]=0 

< 1 . 


(4.6) 
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Therefore, we conclude that the negatively divergent entropy production of the paths 
starting from the region in which the initial probability vanishes is what makes the con¬ 
ventional integral nonequilibrium equality inapplicable to the process starting from a local 
equilibrium state. 


This situation [Eq. (4.3)] makes a stark contrast to ordinary irreversible processes, 
where every backward path has the corresponding forward path with a nonvanishing 
probability: 


v r, pt[U] ± o => v[t] ± o, 


(4.7) 


or 


'r, p(r; = o => -p , (r t ] = o. 


(4.8) 


Therefore, in the ordinary irreversible case, the exponentiated entropy production remains 
finite 


pt [r t] 

e — < co, 


p[r] 


(4.9) 


and the thermodynamic irreversibility is quantitatively characterized by the entropy pro¬ 
duction. Although these paths are thermodynamically irreversible, they are stochastically 


reversible in a sense of Eq. (4.7), namely, every backward path has the nonvanishing for¬ 


ward counterpart. In contrast, if the condition (4.3) holds, there exist backward paths 


that have no counterparts in the forward process, which means that these paths are not 
even stochastically reversible. Therefore, we shall call these paths absolutely irreversible 
paths, and call the processes with absolutely irreversible paths absolutely irreversible 
processes. 

Mathematically, probability theory is based on measure theory, and the ratio /V is 
interpreted as the transformation function of the two probability measures. Therefore, we 
need measure theory to formulate absolute irreversibility. We thus give a mathematical 
definition of absolute irreversibility. 


4.1.2 Definition of absolute irreversibility 

Let A4 [XT] denote the probability measure of the original process. This is a generaliza¬ 
tion of the description in terms of the probability density which is written as V\Y\DY. 
Let the probability measure of the reference process be denoted by J\A r [DY] which is a 
generalization of V r \Y\DY. According to Lebesgue’s decomposition theorem [25, [25], the 
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AT ac + 





Figure 4.2: Schematic illustration of Lebesgue’s decomposition theorem. The abscissa 
represents coordinates of phase space and the ordinate shows the probability density. Ver¬ 
tical lines represent 5-function-like localization. The reference probability measure AT 
(blue solid curve) is decomposed into two parts with respect to the original probability 
measure Af (dashed curve). The probability ratio is well-defined in the absolutely contin¬ 
uous part, whereas it diverges in the singular part. Note that the 5-function-like reference 
probability measure is absolutely continuous if its singular part coincides with that of the 
original probability measure as shown in the middle figure. Reproduced from Fig. 2 of 
Ref. [2?]. Copyright 2014 by the American Physical Society. 


reference probability measure can be uniquely decomposed into two parts as 


AT 


M r AC + M 




(4.10) 


where A4 ac and Alg are absolutely continuous and singular with respect to A4, respec¬ 
tively (see Fig. 4.2). The absolute continuity of A1 ac guarantees that the probability 
ratio is well-defined due to the Radon-Nikodym theorem [25j [26 J as 


c 

DM 


(4.11) 


which is an integrable function with respect to AT In physical terms, it is this ratio 
that gives the entropy production. Therefore, in measure theory, the Crooks fluctuation 
theorem reads 


P-Mac - g 
DM 


(4.12) 


On the other hand, the probability defined by Alg takes a nonzero value in the region 
where the probability defined by A4 vanishes. Therefore, the ratio of Afg to A4 is di¬ 
vergent, and we cannot define a finite entropy production through this ratio. Thus, in 
physical terms, A4g corresponds to the absolutely irreversible part. We therefore iden¬ 
tify M\ c as the ordinary irreversible part and A4g as the absolutely irreversible part. 
If AT S does not vanish, the conventional integral nonequilibrium equality breaks down 
[ZU 1221 [621 63J. See Appendix B for the mathematical definitions of absolute continuity 
and singularity, and a mathematical statement of Lebesgue’s decomposition theorem. 


68 
















Figure 4.3: Schematic illustration of the stronger version of Lebesgue’s decomposition 
theorem. The reference probability measure AT (blue solid curve) is decomposed into 
three parts with respect to the original probability measure A4 (dashed curve). In the 
absolutely continuous part (bottom left), the probability ratio of A4l c to A4 is well- 
defined. In the singular continuous part (bottom middle), the probability ratio of A4g C to 
A4 is divergent because the probability density of A4 vanishes and that of A4g C remains 
nonvanishing. In the discrete part (bottom right), the probability ratio of A4 d to A4 is 
divergent because the probability density of M d is divergent whereas A4 remains finite. 
We note that A4 does not involve (5-function-like localization, which is the assumption of 
the stronger version of Lebesgue’s decomposition and needed for the uniqueness of this 
decomposition (see Appendix B for more detail). Reproduced from Fig. 3 of Ref. [2?]. 
Copyright 2014 by the American Physical Society. 


When M can be written in terms of a probability density, a stronger version of 
Lebesgue’s decomposition theorem holds. In this case, AT is decomposed into three 
parts as 


M' = Mlc + Ml c + m;„ 


(4.13) 


where M.\ c and Alg C are absolutely continuous and singular continusous with respect 
to M, respectively, and M. x d is the discrete part of AT (see Fig. 4.3). The singular 


continuous part Ai r sc corresponds to the region where the probability ratio is divergent 
because the denominator, which is the forward probability, vanishes. This term represents 
the effect of free expansion. The discrete part J\A d has (5-function-like localization, and the 
probability ratio is divergent because the numerator, which is the reference probability, 
diverges. This part arises when particles can localize and do not undergo thermal diffusion; 
such a situation occurs when there are trapping centers of particles. In this way, the 
absolute irreversibility is classified into two categories. The correspondence between the 


classification of irreversibility and that of probability measure is summarized in Table 4.1 
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4.2 Nonequilibrium equalities in absolutely irreversible 


processes 

In this section, we derive nonequilibrium equalities applicable to absolutely irreversible 


processes based on Lebesgue’s decomposition theorem (4.10) and (4.13). 


First, we derive general nonequilibrium equalities in absolutely irreversible processes. 
Then, we show that they reduce to several individual nonequilibrium equalities with their 
specific meanings of the entropy production by proper choices of the reference probability 
distribution as described in Sec. 2.2. 

4.2.1 General formulation 


First, we derive nonequilibrium equalities based on Lebesgue’s decomposition theorem (4.10) 


Let J-’fr] denote an arbitrary functional of a path T, and let (•••), (• • -) r and (• • •)}’ (I=AC, 
S) denote the average over A4, A4 r and A4j, respectively. From Lebesgue’s decomposition 


theorem (4.10), we obtain 


(■O' = (Oao + (Os- 


(4.14) 


On the other hand, using the Radon-Nikodym derivative (B.8), we evaluate the average 


over the absolutely continuous part in terms of the entropy production as 


<^> 


AC 


= Mr] 


T[T}M\ c [VT] 
VM\ C 


VM 

= I F[T]e~ a M[Vr\ 

= <^o, 


M[DY] 


(4.15) 


Table 4.1: Correspondence between the classification of irreversibility and that of prob¬ 
ability measure. 


Class of irreversibility 

Ordinary 

Absolute I 

Absolute II 

Class of measure 

absolutely continuous 

singular continuous 

discrete 

Exponentiated entropy: e~ a 

2) =8mte 
pm 

V r [T 

0 

m ^ 
m ~ 
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where we use the Crooks fluctuation theorem (4.12) to obtain the third line. Therefore, 


by Eqs. (4.14) and (4.15), we obtain 


(^O = (^) r - (JOs, 


(4.16) 


which can be regarded a generalization of the master fluctuation theorem (2.35). If we 
set T to unity, we obtain 


(O = 1 - As, 


(4.17) 


where 


As = / M\[DT ] 


(4.18) 


is the probability of the singular part. We note that As is uniquely defined because of 


the uniqueness of Lebesgue’s decomposition (4.10). As explained in the previous section 


and summarized in Table 4.1[ the absolute irreversibility has two classes. Therefore, Ag 
is calculated as the sum of two contributions of these classes of absolute irreversibility. 
One is the probability of those reference paths whose corresponding original paths have 
vanishing probability. The other contribution is the probability of localized reference 
paths. To the best of our knowledge, the localized contribution had not been considered 
before our research [27j. However, this part renders the conventional nonequilibrium 
equalities inapplicable. Namely, only when Ag = 0, we reproduce (e _cr ) = 1. 

Using Jensen’s inequality (e _cr ) > e - ^, we obtain 


(a) > — ln(l - A s ). 


(4.19) 


Therefore, the second-law-like inequality (a) > 0 is valid even with absolute irreversibil¬ 


ity, because the right-hand side of Eq. (4.19) is nonnegative. Moreover, if there exists 


absolute irreversibility Ag > 0, then Eq. (4.19) imposes a stronger restriction on the en¬ 


tropy production than the conventional inequality because the right-hand side will then 
be strictly positive. Thus, the average of the entropy production must be strictly positive 
in absolutely irreversible processes. 

When the stronger version of Lebesgue’s decomposition holds, by following the same 
procedure as before, we obtain 


(e a ) = 1 — A sc — Ad, 


(4.20) 

(4.21) 
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where (•••)[ (i=sc, d) represents the average over A4[, and A sc and Ad are the probabilities 
of the singular continuous part and the discrete part, respectively. We can calculate A sc 
as the sum of the probabilities of those reference paths whose corresponding counterparts 
in the original process vanish. On the other hand, Aa is calculated as the sum of localized 


reference paths. From Eq. (4.21), we obtain an inequality 


(a) > — ln(l - A sc - A d ), 


(4.22) 


which leads to a fundamental lower bound on the entropy production in absolutely irre¬ 
versible processes. 


4.2.2 Physical implications 


We apply the general results in the previous section to processes starting from a restricted 
region. 

Here, we consider time reversal as the reference dynamics. In accordance with Ta¬ 


ble 2.1, in a Langevin or Hamiltonian system, Eq. (4.12) reduces to 


e -' 7 = e~ 0Q 


l, 


AC 


dylig 


d/i f 


0,AC 


r 0 


d/x l 


(4.23) 


where we rewrite the boundary term in terms of measure theory and V represents the 
configuration coordinates in the case of an overdamped Langevin system and the phase 
space coordinates of the system and heat bath in the case of a Hamiltonian system. Here, 
/i 0l /ij and /iL represent the initial probability measure of the original process, the initial 
probability measure of the time-reversed process and the Lebesgue measure on the phase 
space fi, respectively, and Hl,ac is the absolutely continuous part of /il with respect to 
/r 0 and ^ 0AC is the absolutely continuous part of /xj with respect to /xl- 


Generalization of the Jarzynski equality 

Now, let us assume that the initial state of the original process is a local equilibrium state 
which is restricted to a region Dq C as 

/i 0 (dr 0 ) = e ^w( r o,Ao)-^) XDo (r 0 ) AiL (dro), (4.24) 


where yn 0 (ro) is the characteristic function defined by 


XD 0 (r 0 ) 


i r 0 g Dq 
o r 0 £ Dq , 


(4.25) 
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and the free energy is defined by 


e _/3F ° = [ e-P H{r °’ Xo) n L (dr 0 ). 

Jd q 

In this case, the absolutely continuous part of with respect to /x 0 is 

hL,Ac(rfr 0 ) = XD O (r 0 )n L (dr 0 ). 


(4.26) 


(4.27) 


Therefore, we obtain 


d ^,AC = e /3(H(T 0 ,\ 0 )-F 0 ) 

d do r 0 


(4.28) 


On the other hand, we set the initial probability measure of the time-reversed process to 
a local equilibrium distribution in a region D T as 



= e -0(H(x, 


’^-^WiVK^ib 


(4.29) 


which is already absolutely continuous with respect to /iL, namely, /ig AC = /i J,. Therefore, 
we obtain 


°^o, AC 
dii l 


= e Ft) xd t { V t)- 


iy 


Then, substituting Eqs. (4.28) and (4.30) into Eq. (4.23), we obtain 


(4.30) 


e 


—a 


e- m+ * h -* f ) XDt (T t ) 

e-^ W - AF W(r r ), 


(4.31) 


where we use the first law of thermodynamics to obtain the last equality. Thus, Eq. (4.17) 
reduces to 


{e -f3{W-A F) X D T (r T )) = l-\ S . 


In particular, if we set D r to O, we obtain 


{e -m-A f )) = 1 _ As , 


which is a generalization of the Jarzynski equality (2.4) 


(4.32) 


(4.33) 
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Generalization of the Seifert relation 


Here, we assume that the initial probability distribution of the original process is abso¬ 
lutely continuous with respect to the Lebesgue measure, and therefore we have 


/i 0 (dr 0 ) = p 0 (r 0 )/iL(dr 0 ). (4.34) 

Let D 0 denote the support of p 0 . Then, the absolutely continuous part of /il with respect 
to /io is 


hL,Ac(rfr 0 ) — XD 0 (^o)Mdr 0 ). 


(4.35) 


Therefore, the Radon-Nikodym derivative is 


dppAC _ X£>o(r0) 
dpo r 0 Po( r o) ' 


(4.36) 


We set the initial probability distribution of the reference process to be equal to the final 
probability distribution of the original process: /i\ = p T . Then, the Radon-Nikodym 
derivative 


dpo,AC 

dp l 


dp T , AC 

dp l 


r T 


Pr(r r ) 


(4.37) 


represents the (unnormalized) final probability distribution of the original process. There¬ 


fore, Eq. (4.23) reduces to 


_ ~/3Q Pr(r r ) /p \ 

^ /-p \ \Dq ft 0/ 

po(ro) 


_ g^Asbath/fcBp-As/fcB 


xn o (r 0 


(4.38) 


Thus, Eq. (4.17) reduces to 


(e _Astot ) = 1 - A s , (4.39) 

where we use XD 0 (r 0 )po(ro) = Po(ro) because D 0 is the support of po- This is a general¬ 
ization of the Seifert relation (|2.25[). 


Up to here, we set the reference dynamics to the time-reversed dynamics, and have shown 


that our nonequilibrium equality (4.17) with absolute irreversibility derives generalized 


integral nonequilibrium equalities (4.33) and (4.39) involving the entropy production with 
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Table 4.2: Correspondence between reference probabilities, specific meanings of the 
entropy production and nonequilibrium equalities with absolute irreversibility. 


Reference dynamics 

Ref. ini. state 

Entropy production 

Nonequilibrium equality 

time reversal 

canonical 

dissipated work 

{ e - m - afj } = i _ As 

time reversal 

final state 

total 

( e - Ast ° t/fcB ) = 1 - A s 

time reversal 

initial state 

dissipation functional 

(e- n ) = 1 - A s 

dual 

initial state 

housekeeping 

(e' AWfcB ) = 1 - A s 

time-reversed dual 

steady state 

excess 

= i _ Ag 


the heat bath. We note that our nonequilibrium equality (4.17) also derives generalized 
integral fluctuation theorems for the housekeeping and excess entropy production in a 
similar manner under the proper choices of the reference dynamics as summarized in 
Table 14721 


4.3 Examples of absolutely irreversible processes 

In this section, we verify our nonequilibrium equality in three examples: free expansion, 
an overdamped Langevin process starting from a local equilibrium, and an overdamped 
Langevin system with a trapping center. 


4.3.1 Free expansion 


First of all, we discuss the case of free expansion (see Fig. 4.1 (a)). Initially, an ideal single¬ 
particle gas is confined in the left box with temperature T. The entire box is assumed 
to be divided in the volume ratio l : 1 — l by a partition. Since the initial state can be 
described by the probability density, the stronger version of Lebesgue’s decomposition 


holds. Thus, Eq. (4.33) reduces to 


{e -KW-AF)) = i _ Agc _ Ad . 


(4.40) 


We remove the partition, and the gas expands to the entire box. In this process, work is 
not extracted: W = 0, whereas the free energy of the gas decreases due to the expansion 


by A F = k B Tlnl(< 0). Therefore, the left-hand side of Eq. (4.40) is calculated to be 


(e -«IV-A F)) = ; 


(4.41) 


We consider the time-reversed process of this free expansion to calculate the absolute 
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irreversible probabilities (see Fig. 4.1 (b)). In the time-reversed process, the system is 
in a global equilibrium at the initial time. Then, the partition is inserted at the same 
position as the original process. The gas particle is in either the left or the right box. 
The paths ending in the right box have no corresponding forward paths in the original 
process. Therefore, these paths are singular continuous paths. The probability of these 
paths is proportional to the volume fraction of the right box, and therefore the singular 
continuous probability is 


Asc = 1 — I (4.42) 

On the other hand, since there is no discrete path, which is a single path with a finite 
positive probability, we have 


Ad = 0. 


(4.43) 


Therefore, we obtain 


1 A ac A c i — /. 


(4.44) 


Thus, our nonequilibrium equality (4.40) is verified for the case of free expansion of an 
ideal single-gas particle. 


4.3.2 Process starting from a local equilibrium 

Next, we consider an overdamped Langevin process starting from a local equilibrium 
state. The Langevin particle is confined in a one-dimensional ring. The potential consists 
of n identical harmonic potential wells with the same stiffness (spring constant) k(t ) as 


illustrated in Fig. 4.4 (a). Initially, the system is prepared in a local equilibrium state with 
temperature T in a given well, and therefore the initial probability vanishes elsewhere. 
We subject the system to a nonequilibrium process during a time interval r. We decrease 
the stiffness of the potentials from k = K to 0 at a constant rate between t = 0 and r/2, 
and then increase it from k = 0 to n 2 K at a constant rate between f = r/2 and r. Since 
the initial state can be written in terms of the probability density, the stronger version of 


Lebesgue’s decomposition holds. Therfore, Eq. (4.33) reduces to 


{e -m-AF }) = 1 _ Xgc _ Ad . 


(4.45) 


Now, we consider the time-reversed process to calculate the singular probabilities. 
Initially, the system is in a global equilibrium at temperature T with stiffness k = n 2 K. 
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Figure 4.4: (a) Schematic illustration of an overdamped system consisting of n identical 

harmonic potentials confined in a one-dimensional ring. In the original process, the system 
is in a local equilibrium in a given well at the initial time, (b) Probability density of 
work performed on the system for several n values. The triangles indicate the points of 
W = kp,T In n. (c) Values of (e~ im ) at each n. Superimposed is the 1/n curve (with 
no free parameters), (d) Values of (/3W) at each n. The line represents the minimum 
dissipation given by Eq. (4.50). The parameters are chosen as follows: diffusion constant 
D = 10 -13 m 2 /s; temperature T = 300K; duration of the process r = lOsec; half width 
of a single potential a = 10 _6 m; the initial stiffness of the potential K is chosen to 
satisfy Ka 2 /2 = 5/cbT. The statistical average is obtained from 10 6 samples for each n. 
Reproduced from Fig. 4 of Ref. 122]- Copyright 2014 by the American Physical Society. 
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The stiffness is decreased from k = n 2 K to 0 between t = 0 and r/2, and then increased 
from k = 0 to K between t — r /2 and r. Because a backward path terminates in a 
certain well with probability 1 jn due to the symmetry of the potential and the initial 
state of the time-reversed process, the probability that the backward path does not have 
the corresponding forward path is 

77 — 1 

Asc = --• (4.46) 

n 

Moreover, since we have no discrete path, we obtain 


A d = 0. 


(4.47) 


Therefore, Eq. (4.45) reduces to 


( e 


-/3(W-AF)\ _ 


> = ;■ 


(4.48) 


If we assume that K is sufficiently large, A F = 0 in this process. Thus, we obtain 

(e-P w ) = -. (4.49) 

n 

The corresponding inequality reads 


(f3W) > In n. 


(4.50) 


We obtain the probability distributions of W at different n by numerical simulations 
as shown in Fig. 4.4 (b). Based on this probability distribution, the value of (e _/3H/ ) is 
obtained and confirmed to be 1/n (see Fig. 4.4 (c)). We also verify that the average 
dissipation (/3W) is larger than the minimum dissipation given by Eq. (4.50), namely, the 
fundamental lower bound due to the absolute irreversibility as demonstrated in Fig. |4.4 


(d). We note that this process may be regarded as an information erasure of an n-digit 
memory. 


4.3.3 System with a trap 


Finally, we consider an overdamped Langevin system with a trap. The system is one¬ 
dimensional, and the Langevin particle is confined in a single harmonic potential with 
stiffness k{t). We assume that there is a trapping point in the system and the distance 
between the center of the harmonic potential and the trapping point is denoted by x c 
as illustrated in Fig. 4.5 (a). If the particle reaches the trapping point, it is trapped 
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with unit probability. Initially, the system is prepared in an equilibrium of the harmonic 
potential. We subject the system to a nonequilibrium process by changing stiffness k. 
The stiffness is decreased from k = K to 0 at a constant rate between t — 0 and r/2, and 
then increased from k = 0 to K at a constant rate between t = r/2 to r. Since the initial 
probability can be written by the probability density, Eq. (4.39[) reduces to 


(e 


—As t , 


s ) — 1 — A sc — Ad 


(4.51) 


To evaluate the singular probabilities, we consider the time-reversed process. The 
initial state of the time-reversed process is set to the final state of the original process. 
Let p trap and p\ rap denote the trapping probabilities of the final state of the original and 
time-reversed processes, respectively. Because the particle trapped in the original process 
will remain trapped in the entire time-reversed process, the probability of this single path 
is Ptra P (> 0). Therefore, the discrete probability is 

Ad = Ptrap ■ (4.52) 

Moreover, since the paths that fall into the trap in the time-reversed process have no cor¬ 
responding counterparts in the original process, they are singular continuous. Therefore, 
the singular continuous probability is 

Asc = Ptrap — Ptrap- (4.53) 


Thus, Eq. (4.51) reduces to 


( e 


— As t i 


') = 1 ~P\ 


rap i 


(4.54) 


which leads to an inequality 


(As to t) > - ln(l 


Ptrap) 


(4.55) 


This inequality is automatically satisfied because the left-hand side is positively divergent 
due to the presence of those paths that fall into the trap in the original path with a 
positively divergent entropy production. 

Figure [475] (b) shows the probability density of the total entropy production. Based on 
this probability distribution, the exponentiated average of the total entropy production 


is calculated and confirmed to be consistent with Eq. (4.54) as demonstrated in Fig. 4.5 
(c). 
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Figure 4.5: (a) Schematic illustration of a one-dimensional overdamped system with 

a trapping center. In the original process, the system is in an equilibrium of the har¬ 
monic potential, (b) Probability density of the total entropy production. Note that 
this probability density is not normalized over M because we have a positively diver¬ 
gent entropy production for the trapped paths. The triangles indicate the points with 
As tot = _ ln(l— pj rap ). (c) Values of (e~ Astot ) versus the trapping probability pj rap . Super¬ 
imposed is the 1 — pj rap line (with no free parameters). The parameters D and T are the 


same as in Fig. 4.4 The distance between the center of the harmonic potential and the 


trapping point is x c = 10 _6 m. The initial stiffness K is set so as to satisfy Ka 2 /2 = IO^b^. 
The duration of the process r is varied between lsec to lOOsec to change the trapping 
probability. The statistical average is taken over 10 6 samples for each r. Reproduced 
from Fig. 5 of Ref. [27]. Copyright 2014 by the American Physical Society. 
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Figure 4.6: Virtual process corresponding to free expansion. Initially, the single-particle 
gas is in a global equilibrium state; the particle is in the left box with probability l and 
in the right box with probability 1 — l. Then, the partition is removed. Reproduced from 
Fig. 8 of Ref. [2?]. Copyright 2014 by the American Physical Society. 


4.4 Comparison with a conventional method 


In this section, we review a conventional method [64] to compare with our method based 
on absolute irreversibility. 

As an illustration, we rederive our nonequilibrium equality in free expansion, namely, 


Eq. (4.40) by using the conventional method. We consider a virtual process starting from 


a global equilibrium state with temperature T as shown in Fig. |4.6| Namely, the particle 
is in the left box with probability l and in the right box with probability 1 — l. Therefore, 
the probability of paths starting from the right box does not vanish, and absolutely 


irreversible paths do not exist. Thus, the conventional nonequilibrium (2.17) applies as 


«-Ur]e 


-«IV-AFh. _ 


)> = «-F[r]»t 


(4.56) 


where .F[r] is an arbitrary functional, and ((• • •)) denotes the statistical average of the 
virtual process, and the caret ~ means that the accompanying quantity is the one in the 
virtual process. In this simple case, we can easily obtain relations between the physical 
quantities in the virtual process and the corresponding quantities in the original process 
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as 


w = 

w, 

(4.57) 

F 0 = 

F 0 + k B T \n l, 

(4.58) 

F T = 

Ft i 

(4.59) 

A F = 

A F - k B T In /. 

(4.60) 


To derive a nonequilibrium equality in the original process, we set J-"[r] to the charac¬ 
teristic functional whose value is equal to one only if the path starts from the left box 
and zero otherwise. Then, considering the normalization of probabilities properly, we can 
express the average in the original process in terms of that in the virtual process as 

«JTT].--» = /<-••>. (4.61) 

In the time-reversed process, the probability of paths ending in the right box is l, and 
therefore we obtain 


«.F[r]»t = /. 


(4.62) 


Therefore, Eq. (4.56) reduces to 


{ e - m - af)) = 


(4.63) 


which agrees with Eq. (4.40) derived from the nonequilibrium equalities with absolute 


irreversibility. Following a similar procedure, we can also rederive Eqs. (4.45) and (4.51) 
in principle. 

Let us examine the meaning of this conventional procedure. First, we extend the initial 
probability distribution to the global canonical distribution of the system to circumvent 
the problem of the vanishing probability. Then, we derive the nonequilibrium equality 


(4.56) in this artificial process. Next, we express physical quantities of the artificial process 


by those of the original process as in Eqs. (4.57) and (4.60). Finally, we erase some paths 


irrelevant to the original dynamics by setting T to the characteristic function, and relate 


the average of the virtual process to that of the original process as in Eq. (4.61). 


In this way, the conventional method must introduce the artificial process to derive 
the nonequilibrium equality to avoid the problem arising from absolute irreversibility. In 
contrast, our method directly deals with the original process. Moreover, the reason why 


the right-hand sides of Eqs. (4.40), (4.45), and (4.51) deviate from one is clearer in our 


method, that is, the probability of absolutely irreversible paths should be subtracted from 
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one. Incidentally, we note that although Eq. (4.51) can in hindsight be derived by the 
conventional method, we naturally find this absolutely irreversible example by virtue of 


the stronger version of Lebesgue’s decomposition (4.13). 
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Chapter 5 


Information-Thermodynamic 
Nonequilibrium Equalities in 
Absolutely Irreversible Processes 


In this chapter, we generalize the nonequilibrium equalities in the presence of absolute ir¬ 
reversibility obtained in Chap. 4 to situations under measurements and feedback control. 
This generalization is of vital importance when error-free measurements are preformed, 
because they project the probability distribution of the measured state onto a localized re¬ 
gion in the phase space. The subsequent time evolution of this confined post-measurement 
state, in general, involves expansion into an initially unoccupied region, which provides 
yet another example of absolute irreversibility. The generalized nonequilibrium equalities 
under measurements and feedback control enable us to identify the unavailable informa¬ 
tion, which results from the inevitable loss of information that we cannot utilize under 
a fixed feedback protocol. The unavailable information provides a fundamental limit of 
performance of the feedback protocol. 

First, we derive information-thermodynamic nonequilibrium equalities in the presence 
of absolute irreversibility. Next, we introduce a notion of unavailable information and de¬ 
rive a different type of nonequilibrium equalities that involve the unavailable information. 
Finally, we demonstrate our nonequilibrium equalities in several examples. 

This chapter is partly based on Refs. mm- 

5.1 Infor amt ion-thermodynamic equalities 

In this section, we derive information-thermodynamic nonequilibrium equalities in the 
presence of absolute irreversibility. This section is mainly based on Ref. m 

Under feedback control, it is important to consider the effect of the singular part of 
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the reference probability measure discussed in Sec. 4.1.2 because high-precision measure¬ 
ments such as error-free measurements localize the probability distribution. Since the 
feedback control starts from this post-measurement state confined in a narrow region, the 
subsequent time evolution involves expansion into an initially unoccupied region unless 
the feedback protocol is fine-tuned, and therefore the process exhibits absolute irreversibil¬ 
ity. In experiments, since we have access to only a few parameters, the fine-tuning is a 
difficult task in general. Hence, absolute irreversibility has experimental relevance in a 
system under measurements and feedback control. 

We consider a nonequilibrium process in a classical system with measurements and 
feedback control from time t — f; to if as in Sec. 3.3.2. Let x(t) and y(t) denote phase- 
space points at time t of the system and the measurement outcomes, respectively. The 
external control parameter A(t) is determined based on the measurement outcome before 
f, namely, {^/(s)}* =t .. Let A", Y and A[F] denote the entire paths of the system, the 
measurement outcomes and the control parameter, respectively. Moreover, let M\y denote 
the conditional probability measure of X under given measurement outcomes Y, and let 
M V [Y] denote the reference probability measure of X under a given feedback protocol 
A[K]. We apply Lebesgue’s decomposition theorem to .MWyi with respect to M\y, and 
obtain 


M] m — -Ad^qAjy] + *Ms|A[y]> 


(5.1) 


where the first and second terms on the right-hand side give the absolutely continuous and 
singular parts of the reference probability measure, respectively. Lebesgue’s decomposi¬ 
tion theorem ensures that this decomposition is unique. Let J~[X, Y] denote an arbitrary 
path functional. It follows from Eq. (5.1) that 


— {J~) AC|A[y] + (^SIAIYp (5-2) 

where (- - - )i|yv[y] (I = 0, AC, S) denotes the average over Moreover, the average 

over the absolutely continuous part can be transformed by the Radon-Nikodym derivative 
because 


<^> 


AC|A[Y] 


nx,Y]M' Kcm px] 

VM C , 


= j nx, n 


l AC|A[Y] 


DM 


\Y 


M\ Y [DX\ 


x 


(5.3) 
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where (• ■ ■ )|y denotes the average over M\y, and we define 


e 


-R 


ZWaciam 

X>M. |y 


X 


(5.4) 


Therefore, we obtain 


= (7) 


|A[y] w )s|A[y]- 


(5.5) 


If we set J 7 to unity, we obtain 


(e i? )|y = 1 - A S |a[y], 


(5.6) 


where 


As|a[v] = / Ml\ K[Y] [VX\ 


(5.7) 


is the probability of the singular part of the reference probability measure conditioned by 

A[n 


Here, we consider the physical meaning of R defined in Eq. (5.41 1 The Crooks fluc¬ 


tuation theorem (4.12) applies to the ratio of the transition probabilities under a fixed 
protocol: 


e 


—a 


VM\ my] 

wTvi 




(5.8) 


where Alf'Tyi is the transition probability under the fixed protocol A[E], Therefore, 
R is different from a because the conditional probability is different form the transition 
probability as discussed in Sec. 3.2.2. The difference is represented by the Radon-Nikodym 
derivative as 


—a — (j 

e=e 


VM 


\Y 


X 


(5.9) 


Comparing this with Eq. (3.133), we find that the Radon-Nikodym derivative in this 
equation may be written as e -/ ~ 7i , where / is the total mutual information obtained by 


1 Here, we assume that Af^ C |A[i'] * s absolutely continuous with respect to M j^y] and that 
is absolutely continuous with respect to M.\y- Otherwise, we cannot define the entropy production 
and mutual information individually, because their values are divergent and mathematically ill-defined. 
However, the sum of these quantities remains finite due to the absolute continuity of Afkc|A[v] 
respect to M.\y- 














the measurements and /; is the initial correlation between the system and the measurement 
outcomes. Therefore, we have 


R — CT + / + /(, 


(5.10) 


and Eqs. (5.5) and (5.6) reduce to 




Y 


(e-^OlY 


(Film w )s|A[y]j 

1 - A S |A[K], 


respectively. Averaging these over measurement outcomes Y, we obtain 

(Re-*- 1 - 1 ') = (RY-(T)l, 

<■s -°- I - h ) = 1-As, 


(5.11) 

(5.12) 


(5.13) 

(5.14) 


where As is the average over Y of the A[Y]-conditioned singular probability As|a[v]- Equa¬ 


tion (5.14) derives a second-law-like inequality as 


(a) > -(/ + /;) - ln(l - A s ). 


(5.15) 


Therefore, the lower bound of the entropy production is determined not only by the 
mutual information but also by the term arising from the absolute irreversibility. If the 


feedback protocol is so poorly designed that — ln(l — As) > (I + Jj), Eq. (5.15) implies 


that the entropy production is positive and the feedback protocol does not work. In the 
case of vanishing initial correlations, we obtain 


(e-’- 1 ) = 1 — As. 

(a) > —(/) - ln(l - A s ). 


(5.16) 

(5.17) 

(5.18) 


Moreover, if the conditioned initial states A4\y satisfy the assumption of the stronger 


version of Lebesgue’s decomposition (4.13), we obtain 


(Re-*- 1 ) = (R) 1 - (R)l c - (R) T d , 


(e ) = 1 - A sc - A d , 

(a) > -(I)- ln(l - A sc - A d ), 


(5.19) 

(5.20) 

(5.21) 


where the subscripts “sc” and “d” represent the singular continuous and discrete parts, 
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respectively. 

In the same manner as the case without feedback control, proper choices of the refer¬ 
ence probability lead to nonequilibrium equalities with specific meanings of the entropy 


production as listed in Table 4.2 and we obtain the corresponding equalities 


/ e -P(W-AF)-I\ 

= 1 — As, 

(5.22) 

A s tot//cB —I 

= 1 — As, 

(5.23) 


= 1 — A s , 

(5.24) 

( e ~ A S h k /fc B -^ 

= 1 — As, 

(5.25) 

e ~A<f>-As^/k B -I\^ 

= 1 — As, 

(5.26) 


under the proper assumptions described in Sec. 2.2. 


5.2 Unavailable information and associated equali¬ 
ties 

In this section, we define a concept of unavailable information based on the information- 
thermodynamic nonequilibrium equalities obtained in the previous section, and derive new 
nonequilibrium equalities that involve the unavailable information. Inequalities derived 
from the new equalities give an achievable lower bound of the entropy production in the 
case of error-free measurements. 


First of all, we point out that the equality of Eq. (5.18) cannot be achieved in general 


even in the quasi-static limit. The equality condition of Jensen’s inequality ( e~ x ) > e~^ 
is that the quantity x does not fluctuate. On the other hand, in the quasi-static limit, 
a is expected to have a single definite value. Therefore, in situations without feedback 
control, the equality in the inequality 


(ct) > — ln(l - A s ) 


(5.27) 


is achieved. In contrast, the equality in the inequality under feedback control 


(a) > -(/) - ln(l - A s ) 


(5.28) 


cannot be achieved in general even in the quasi-static limit due to fluctuations of /, that is, 
I depends on the measurement outcomes unless all outcomes are obtained with the same 


probability. Hence, Eq. (5.28) gives only a loose lower bound of the entropy production, 


although it is tighter than the conventional inequality ((a) > —(I)) 





To find an achievable lower bound of the entropy production, we start from Eq. (5.12). 
If we assume that we have no initial correlations, Eq. (5.12) reduces to 


(e a 7 )|y — 1 — As|A[yp (5.29) 

From Jensen’s inequality, we obtain 


(o")|f > ~{I)\y ~ ln(l — A S |A[r])- 


(5.30) 


What is noteworthy on this inequality is that the equality is achievable under error-free 
measurements in the quasi-static limit, because / reduces to the unaveraged Shannon 
entropy of Y, which has a single definite value under fixed measurement outcomes Y. 


Therefore, the right-hand side of Eq. (5.30) gives an achievable lower bound of the entropy 


production for a fixed Y. Hence, the last term in Eq. (5.30) represents the inevitable 


dissipation due to the incompleteness of the feedback protocol A[Y]. In other words, 
the feedback protocol A[Y] cannot fully utilize the mutual information obtained by the 
measurements. Thus, we define unavailable information in the protocol A[Y] as 


Ai|A[y] — — ln(l — A S |a[y]) (> 0). 


(5.31) 


We note that, under error-free measurements and in the quasi-static limit, we have 


cr = — I + I u (error-free,quasi-static). 


(5.32) 


Using Eq. (5.31), we rewrite Eq. (5.29) as 


p — /+iu\, _ 


}|F = I- 


(5.33) 


Averaging this equality over Y, we obtain a new nonequilibrium equality: 


(e 


-cr—I-\-I u \ _ 


>=i. 


(5.34) 


This equality leads to 




(5.35) 


We note that this inequality is stronger than Eq. (5.28) due to the convexity of the 
logarithmic function — In x, that is, 


{In) ~ (— l n (l — A S |a[y])) > — hi(l — As). 


(5.36) 









Furthermore, Eq. (5.35) gives an achievable lower bound of the entropy production under 


error-free measurements and in the quasi-static limit because of Eq. (5.32). Therefore, we 


can quantitatively characterize the incompleteness of the feedback protocol by calculating 
the unavailable information. 

Here, we assume that the initial state is an equilibrium state, and set the reference 


probability to the time-reversed one starting from an equilibrium state. Then, Eqs. (5.34) 


and (5.35) reduce to Jarzynski-type equations as 


/-«F-AF)-7+/ u \ = 1; 

-(IE) < -A F + k B T(I-I u ), 


(5.37) 

(5.38) 


respectively. Equation (5.38) gives an achievable upper bound of extractable work in the 


feedback process. The bound is reduced due to the unavailable information compared 
with the conventional result 


The other choices of the reference probability summarized in Table 2.1 also give 
nonequilibrium equalities with their individual meanings. 


5.3 Examples of absolutely irreversible processes 

In this section, we verify the information-thermodynamic nonequilibrium equalities de¬ 
rived in the previous sections. Here, we only consider relations with dissipated work, 
namely, 


/ e -0(W-AF)-Jv = 

1 — As, 

(5.39) 

(W) - A F > 

—k B T(I) — k B T ln(l — A s ), 

(5.40) 

e -P(W-AF)-I+I u \ = 

1, 

(5.41) 

(W) - A F > 

-k B T(I-I u ). 

(5.42) 


First, we consider a measurement and subsequent trivial feedback control. Second, 
we discuss the two-particle Szilard engine. Finally, the multi-particle Szilard engine is 
considered. 


5.3.1 Measurement and trivial feedback control 

We consider a measurement and the subsequent feedback control. Initially, an ideal single- 
particle gas is in a global equilibrium state of the box as illustrated in Fig. 5.1 (a). At 


a certain time, we perform an instantaneous error-free measurement to determine the 
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X=L X=R X=L X=R 


Forward 


Backward 


Figure 5.1: (a) A measurement and the subsequent trivial feedback control. Initially, 

an ideal single-particle gas is in a global equilibrium state of the entire box. At a certain 
time, we perform an error-free position measurement to determine whether the particle is 
in the left or right side. After the measurement, we do nothing as the feedback control, 
(b) The time-reversed protocol. Initially, the particle is in the global equilibrium of the 
entire box and we do nothing regardless of the outcome of the forward process. At the 
time of the measurement of the forward process, the particle is stochastically in the left 
or right side. For the outcome X = L (R), the case in which the particle is in the right 
(left) side is singular (blue arrows). Reproduced from Fig. 6 of Ref. [27], Copyright 2014 
by the American Physical Society. 


position A" of the particle. An outcome X = L is obtained when the particle is in the left 
side, that is, the length from the left-end wall is shorter than the length of the whole box 
multiplied by l (0 < l < 1). On the other hand, an outcome X = R is obtained when the 
particle is in the right side. In both cases, we conduct trivial feedback control, i.e., we 
leave the system as it is. Therefore, the gas expands to the entire box. 

In this process, work is not extracted: W = 0, and the free energy does not change: 
A F = 0. Let p(L) and p( R) denote the probabilities to obtain outcomes L and R, 
respectively. If the measurement outcome is X = L, we obtain mutual information / = 
— lnp(L) = — In/. If X = R, we obtain mutual information / = — lnp(R) = — ln(l — /). 

Next, we consider the time-reversed process to find the singular probabilities. In the 
time-reversed process, we do nothing for both A" = L and R, because we do nothing in the 
forward process except for the measurement. The time-reversed process corresponding to 


the outcome X = L is illustrated in the left half of Fig. 5.1 (b). The particle is in the 
left side with probability l and in the right side with probability 1 — /. The case of the 
particle being found in the right side has no corresponding forward event under A" = L, 
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and therefore it is a singular event. Therefore, we obtain 


A 


S|A[L] 


= 1 - 1 . 


(5.43) 


On the other hand, in the time-reversed process corresponding to the outcome X = R 


illustrated in the right half of Fig. 5.1 (b), we obtain 


'S|A[R] 


= /. 


(5.44) 


Therefore, the unavailable information is obtained as 

fu|A[L] = — bi(l — A S |a[l]) = — hi/, 

7 u |A[r] = - br(l - A S |A[R]) = - ln(l - /). 


(5.45) 

(5.46) 


We note that the unavailable information coincides with the mutual information in this 
case, because all the information is lost since we do nothing as feedback control. The 
values of the physical quantities are summarized in Table |5.1[ 


The left-hand side of Eq. (5.39) is calculated as 


(g-Wr-AFW) = p( L ) • e ln * + p(R) • e ln ^ 

= / 2 + (l-/) 2 . 

On the other hand, the averaged singular probability is 


(5.47) 


As = p(L)As|a[l] + p(R)As|a[r] 

= 2 /( 1 -/). 


(5.48) 


Therefore, by a simple calculation, we can verify Eq. (5.39). Next, let us verify Eq. (5.40). 
The average of the dissipated work is 


(W) - AF = 0. 


(5.49) 


Table 5.1: Values of physical quantities corresponding to the outcomes L and R. 


X 

P(X) 

W 

A F 

/ = -lnp(X) 

As|a[x] 

7 u |A[x] — — ln(l — As|a[x]) 

L 

l 

0 

0 

-In / 

l-l 

-In / 

R 

l-l 

0 

0 

- ln(l - /) 

l 

- ln(l - /) 
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The right-hand side of Eq. (5.40) is 


- k B T(I) - k B T ln(l - A s ) = k B T[l Inl + (1 - l) ln(l - l) - In(Z 2 + (1 - /) 2 )].(5.50) 


By analytic calculation, we can show that the right-hand side of Eq. (5.50) is nonpositive, 
that is, 


k B T[l In l + (1 -l) ln(l - l) - In(Z 2 + (1 - /) 2 )] < 0, 


(5.51) 


and it is zero if and only if / = 1/2. Thus, Eq. (5.40) is verified. Moreover, the necessary 
and sufficient condition for the equality is l = 1/2, which is consistent with the observation 


made in Sec. 5.2, namely, the equality in the inequality (5.28) is achieved only when all 
outcomes are obtained with the equal probability. 


Now, we verify Eqs. (5.41) and (5.42) in the presence of the unavailable information. 


The left-hand side of Eq. (5.41) is calculated as 


{ e -^W-AF)-I+I U) = p(L) . glnZ-In/ +p(R) . e ln(l-I)-ln(l-Q = ^ 


(5.52) 


which verifies Eq. (5.41). Moreover, because I u — I in this case, the right-hand side of 


Eq. (5.42) is 


- k B T(I - I u ) = 0. 


(5.53) 


Therefore, we obtain 


W — AF = —k B T(I — J u ). 


(5.54) 


This equality means that the equality in the inequality (5.42) is achieved regardless of the 


value of /, which is what we expect because Eq. (5.42) gives an achievable bound. 


5.3.2 Two-particle Szilard engine 

Next, we consider the two-particle Szilard engine. The reason why we do not consider the 
single-particle Szilard engine as in Refs. PHI 23| is that the singular part does not arise 
in the single-particle Szilard engine since we can fully utilize the information obtained 
by the measurements. Therefore, we should consider a Szilard engine with two or more 
particles to observe effects of the absolute irreversibility. We begin by the two-particle 
Szilard engine. 


Initially, two identical ideal-gas particles are confined in a box as illustrated in Fig. 5.2 
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Figure 5.2: (a) Two-particle Szilard engine. Initially, an ideal two-particle gas is in a 

global equilibrium of the entire box. Then, we insert a partition in the middle of the box 
and perform an error-free measurement to find the number n of the particles in the right 
box. Depending on the outcome, we isothermally and quasi-statically shift the partition to 
such a position that the extracted work is maximal. Finally, we remove the partition, (b) 
Time-reversed protocol for each n. The partition is inserted and then shifted in accordance 
with the value of n. We have singular paths without the corresponding forward paths 
in the case of n — 1 indicated by blue arrows. Reproduced from Fig. 7 of Ref. ra. 
Copyright 2014 by the American Physical Society. 
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We insert a partition in the middle of the box. Then, we perforin an error-free measure¬ 
ment to find the number n of the particles in the right side. If n = 0, we isothermally 
and quasi-statically shift the wall to the right end to extract work, and then remove the 
partition. If n = 2, we shift the wall to the opposite direction to obtain the same amount 
of work, and then remove the partition. If n = 1, we just remove the wall because we 
cannot extract work by shifting the partition. For all n, the system returns to the initial 


state after these protocols. The values of extracted work are summarized in Table 5.2 


We consider the time-reversed process to obtain the singular probabilities (see Fig. 5.2) 


Initially, the two particles are in a global equilibrium regardless of n. When n = 0, we 
insert the partition in the right end of the box. Then, we isothermally and quasi-statically 
shift the partition to the middle of the box and remove the partition. Because this path 
has the counterpart in the forward process, we have no singularity when n = 0: 


As|A[o] = 0. 


(5.55) 


Similarly, we have no singular part when n = 2: 


As|A[2] = 0. 


(5.56) 


Now, let us consider the time-reversed process when n = 1. In this case, we insert the 
partition in the middle of the box and then remove it. The probability of the two particles 
being found in the left box has nonvanishing probability, so does the probability of both 
particles being found in the right box. However, we have no corresponding forward paths 
in the protocol of n = 1. Therefore, these time-reversed paths are singular, and the 
singular probability is 


^S|A[1] = 


1 

2 ' 


(5.57) 


Table 5.2: Values of physical quantities corresponding to the outcome n. 


n 

p{n) 

W 

A F 

I = — In p(n) 

•^S|A[n] 

-^u|A[n] lll(l ^S|A[n]) 

0 

1 

4 

— 2/cbT1ii2 

0 

In 4 

0 

0 

1 

1 

2 

0 

0 

In 2 

1 

2 

In 2 

2 

1 

4 

—2/cbT1ii2 

0 

In 4 

0 

0 
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Thus, the unavailable information is 


Ai|A[0] = 0, 

Ai|A[ 1] = bl 2, 

Ai| A [ 2 ] = 0. 


(5.58) 

(5.59) 

(5.60) 


Now, we are ready to verify our nonequilibrium equalities. The left-hand side of 


Eq. (5.39) is calculated as 


^-PiW-AFyij = p . g2/Jfc B T In 2—In 4 + p ^ . g-ln2 + . ^k B T In 2-ln4 = ^ ( 5 . 61 ) 

On the other hand, the average of the singular part is 


As — p(0)A S |a[o] +p(1)As|a[i] +p(2)A S |a[2] = -• 


(5.62) 


Therefore, Eq. (5.39) is verified. The left-hand side of Eq. (5.40) is 


(W)-AF = -k B T In 2. 


(5.63) 


On the other hand, the right-hand side of Eq. (5.40) is 


(I) -ln(l-As) = -ln-^=. 


Hence, we obtain 


(5.64) 


(IE) - A F> -k B T{I) - k B T ln(l - A s ), 


(5.65) 


and Eq. (5.40) is verified. We note that the equality in the inequality (5.40) is not achieved 


because the probability of finding a particular value of n varies for each n. 


The left-hand side of Eq. (5.41) is calculated as 


( e -/3(W-A F)-/+/ U ) = p (Q) . e 2/?fc B Tln2—In 4 + . e -ln2+ln2 + ^ In 2-ln 4 = ^ (5 gg) 


which verifies Eq. (5.41). The right-hand side of Eq. (5.42) is 


— k B T (I — Ai) = —k B T In 2. 


(5.67) 
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Thus, we obtain 


(W) — AF — —k B T(I — J u ), 


(5.68) 


verifying Eq. (5.42). Because Eq. (5.42) gives an achievable bound of work in the quasi¬ 


static limit, the equality in Eq. (5.42) is achieved. 


5.3.3 Multi-particle Szilard engine 

We generalize the two-particle Szilard engine to the multi-particle case. Initially, an ideal 
IV-particle gas is in a global equilibrium of the entire box. We insert a partition in the 
middle of the box. Then, we measure the number n of the particles in the right box. 
Then, we isothermally and quasi-statically shift the partition to the position that divides 
the entire box according to the ratio (N — n ) : n. Finally, we remove the partition. The 
probability of the outcome n being found is 

P(n) = ( 5 - 69 ) 

where nCu represents the number of combination that we choose n objects from N objects. 
Therefore, the information obtained by the measurement is 

J(n) = In 2 — In jvC n . (5.70) 

The work during the process can be calculated as 

W = -nk B Tln ^ - (N - n)k B Tln 2(yN ~ ^ 

Tl (N — 71 ) 

= -nk B T In — - {N - n)k B TIn K J - Nk B T In 2. (5.71) 

Since the system returns to the initial state, we have 

A F = 0. (5.72) 

We calculate the singular probabilities by considering the time-reversed process, and ob¬ 
tain 

As|A[„] = l -*<?„(-) (-jy-J . (5.73) 
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and 


Ti N — Tl 

All a [n] = - In nC u - n In — - (N - n) In . 


The left-hand side of Eq. (5.39) is 




N 
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n =0 
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On the other hand, the averaged singular probability is 

N 

As = ^p(n)\s\A[n] 

n =0 
N 

= X>( 


in 


n =0 


i n I n Y ( N - n 
1 NCn[ N) { N 


N-n 


, {nCu) 2 / n\ n f N — n 

~ ^ 2 N \N 


n =0 


N 


Therefore, Eq. (5.39) is verified. 


By a simple calculation, we obtain 


W - AF = -k B T(I - I u ). 


(5.74) 


(5.75) 


(5.76) 

(5.77) 

(5.78) 


(5.79) 


ffence, Eqs. (5.41) and (5.42) are automatically satisfied. Moreover, the equality in 


Eq. (5.42) is achieved, because the measurement is error-free and the process is quasi¬ 
static. 


Large-A" limit 

We consider the laige-N limit of the Wparticle Szilard engine. The fluctuation of the 
particle number n in the right box after the partition is inserted is of the order of a /N. 
Therefore, the dominant contributions to the average of physical quantities come from 
events with 



(1 + x), x = O 
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(5.80) 




















We consider approximate formulae in this range of n. 
The logarithm of the probability p(n) is calculated as 


In p(n) = In N\ — Inn! — ln(iV — n)\ — N\n2. 
By using the Stirling formula 


1 N ( 1 

In TV! = - ln(27riV) + Nln — + O ( — ) , 


we obtain 


(5.81) 


(5.82) 


IN ( 1 

In pin) = - In ^ —— + N In N — n In n — (N — n ) In (AT — n) — N In 2 + O f — 

= l ln T jV(l 1 2 -P) - f (1 + x) 111(1 + x) - 1 (1 - x) ln(1 - x) + 0 (jf 

= -In — - — X 2 + O ( — 

2 nN 2 \N 


Therefore, we obtain 


N „ , 

f i M 

— —a: 2 + O 

— 

2 ' 



P( n ) = \l ^ ex P 


and 


T/ 1 71N N 2 / 1 

I(n) = -\n— + - x +ol- 


In a similar manner, the unavailable information is calculated as 

1 


*u|A[n] 


1 In N 

2 11 2im(N — n) 

1 7T N ( 1 

= - ln-1 - O [ — 

2 2 \N 


+ ° l N 


The average of the mutual information is evaluated as 

N 

(I) = ^P(n)l(r 


(n 


n =0 

N 

= — / p(n)I(n)dx 

1 . 77 N 1 / 1 

= - In-1-h O — . 

2 2 2 \N 


(5.83) 


(5.84) 


(5.85) 


(5.86) 


(5.87) 
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Therefore, the conventional second law of information thermodynamics 


(W) -A F> - k B T(I) (5.88) 

reveals that the effect of feedback control is sub-extensive and of the order of In N. 

On the other hand, the average of the unavailable information is obtained as 

1 nN ( 1 \ 

(« = 2>»- + 0( F ). (5.89) 

Therefore, the amount of the available information is 

(I~h) = \ + 0 (T) . (5.90) 

Thus, our inequality 

(W) - A F> -k B T(I - J u ) (5.91) 

limits the effect of feedback control to the order of unity, which is a qualitatively new 
restriction compared with that of the conventional second law of information thermody¬ 
namics. The same behavior was derived by another method [65] based on the Kawai- 
Parrondo-Van den Broeck equality [66], EZj. 


100 


Chapter 6 

Gibbs’ Paradox Viewed from 
Absolute Irreversibility 


In this chapter, we apply our nonequilibrium equalities in the presence of absolute irre¬ 
versibility to the problem of gas mixing. This problem is what Gibbs’ paradox concerns 
[29]. Gibbs’ paradox is qualitatively resolved once we realize that there are many entropies 
pm E23 ■ The most prevalent quantitative resolution based on quantum mechanics is in 
fact irrelevant to Gibbs’ paradox |31[ [32] , Another well recognized resolution is based 
on the extensivity of the thermodynamic entropy [ 33l l32 j. However, this resolution is 
valid only in the thermodynamic limit, and cannot deal with any sub-leading effects. We 
propose a new quantitative resolution of Gibbs’ paradox based on our nonequilibrium 
equalities. 

First of all, we review the original Gibbs’ paradox and later qualitative discussions. 
Then, we present two widespread quantitative resolutions. Finally, we resolve Gibbs’ 
paradox from the viewpoint of absolute irreversibility. 

6.1 History and Resolutions 

In this section, we review a brief history of Gibbs’ paradox and discuss its resolutions. 
We first review the original discussion and interpretation given by Gibbs. Next, we 
review works by later researchers that clarify Gibbs’ interpretation and argue that Gibbs’ 
paradox is qualitatively resolved even in classical theory. Then, we discuss the quantum 
resolution of Gibbs’ paradox, which is the standard resolution of Gibbs’ paradox. Finally, 
we examine yet another resolution given by Pauli. 
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6.1.1 Original Gibbs’ paradox 


First of all, we review Gibbs’ original discussion which appeared in his writing titled “On 
the Equilibrium of Heterogeneous Substances” [2SJ. We consider mixing of two different 


gases (see Fig. 6.1 (a)). The gases are confined in a box partitioned into equal halves. 


Initially, an ideal IV-particle gas of one kind is confined in the left side with temperature T, 
and an ideal IV-particle gas of the other kind is in the right side with the same temperature. 
Then, we remove the partition and the two gases expand to the entire box. Let S(T, V, N ) 
denote the thermodynamic entropy of an ideal gas with temperature T, volume V and the 
number of particles N. The entropy production of this mixing process can be calculated 
by the Clausius definition of the thermodynamic entropy. The entropy production of one 
gas is given by 


S(T, 2V, N) - S(T , V,N) = - 


"(T,2V,N) 


' (T,V,N) 


5Q_ 
T ’ 


( 6 . 1 ) 


where Q denotes the heat transferred from the system to a heat bath, and the integral is 
conducted along an arbitrary virtual quasi-static process that connects the state (T, V, N ) 
to (T, 2V, N). We set the virtual process to the quasi-static isothermal process. Due to 
the first law of thermodynamics and the fact that the internal energy of an ideal gas does 
not change in an isothermal process, we have 


6Q — — pdV ., 


( 6 . 2 ) 


where p is the pressure of the gas. Therefore, we obtain 

r2V pdV 
~T r ' 


S(T, 2V, N ) - S(T, V, N ) = 


'v 


Using the equation of state pV = Nk-^T, we obtain 


r- 2V 


dV 


S(T , 2V, N) - S(T, V, N) = Nk B \ — = Nk B In 2. 


>v 


(6.3) 


(6.4) 


Due to the additivity of the thermodynamic entropy, the entropy production of two dif¬ 
ferent gases is the sum of their individual entropy productions. Therefore, the entropy 
production of the mixing of two different gases is given by 


AS dii = 2Nk B In 2. 


After Gibbs derived Eq. (6.5), he argued 


(6.5) 
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Figure 6.1: (a) Mixing of two different gases. Initially, an ideal iV-particle gas is confined 
in the left side and another ideal iV-particle gas of the other kind is confined in the right 
side. Then, we remove the partition, and the two gases expand to the entire box. (b) 
Mixing of two identical gases. Initially, two ideal IV-particle gases of the same kind are 
confined both in the left and right sides. Then, we remove the partition, and the gases 
expand to the entire box. 


“It is noticeable that the value of this expression does not depend upon the 
kinds of gas which are concerned, if the quantities are such as has been sup¬ 
posed, except that the gases which are mixed must be of different kinds. If 
we should bring into contact two masses of the same kind of gas, they would 
also mix, but there would be no increase of entropy.” 


In fact, when we consider mixing of two identical gases (see Fig. 6.1 (b)), the initial 
entropy of the system is given by 


2 S(T,V,N) (6.6) 

due to the additivity of the thermodynamic entropy. On the other hand, the final entropy 
is 


S{T,2V,2N), 


(6.7) 


which is equal to the initial entropy (6.6) due to the extensivity of the thermodynamic 
entropy: 


S(T,qV,qN) = qS(T,V,N), 


( 6 . 8 ) 


where q is an arbitrary positive real number. Therefore, the entropy production of the 
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mixing of two identical gases is 


A S id = 0, 


(6.9) 


which is different from the entropy production of the mixing of two different gases (6.5), 


although Eq. (6.5) is “independent of the degrees of similarity or dissimilarity between 
them.” In particular, if we consider 


“the case of two gases which should be absolutely identical in all the proper¬ 
ties (sensible and molecular) which come into play while they exist as gases 
whether pure or mixed with each other, but which should differ in respect to 
the attractions between their atoms and the atoms of some other substances,” 


the thermodynamic entropy increases in the mixing process of two different gases, although 

“the process of mixture, dynamically considered, might be absolutely identical 
in its minutest details (even with respect to the precise path of each atom) 
with processes which might take place without any increase of entropy,” 

i.e., with processes of the mixing of identical gases. This paradoxical consequence of the 
gas mixing is referred to as Gibbs’ paradox. 

To explain this fact, Gibbs stressed, 

“if we ask what changes in external bodies are necessary to bring the system 
to its original state, we do not mean a state which shall be undistinguishable 
from the previous one in its sensible properties. It is to states of systems thus 
incompletely defined that the problems of thermodynamic relate.” 

In other words, the thermodynamic entropy is not a property of a microstate, but a 
property of a set of microstates indistinguishable from each other, i.e., a property of a 
macrostate. In this way, “entropy stands strongly contrasted with energy.” Therefore, 

“the mixture of gas-masses of the same kind stands on a different footing from 
the mixture of gas-masses of different kinds,” 

This explanation given by Gibbs was clarified by later researchers as we see in the next 
section. 


6.1.2 Gibbs’ paradox is not a paradox 

In this section, we review discussions by three researchers, and argue that Gibbs’ paradox 
is not a paradox even within classical statistical mechanics. 
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Grad |30j 

Grad discussed Gibbs’ paradox in the introduction of his article titled “The Many Faces 
of Entropy” [30]. In the introduction, he emphasized, 

“A given object of study cannot always be assigned a unique value, its ‘en¬ 
tropy.' It may have many different entropies, each one worthwhile.” 

and argued, 

“much of the confusion in the subject is traceable to the ostensibly unifying 
belief (possibly theological in origin!) that there is only one entropy.” 

In fact, Gibbs’ paradox is resolved once we renounce this belief. Grad continued, 

“Whether or not diffusion occurs when a barrier is removed depends not on a 
difference in physical properties of the two substances but on a decision that 
we are or are not interested in such a difference (which is what governs the 
choice of an entropy function). There is no paradox to any observer. When 
he is aware of a difference in properties, he observes diffusion together with 
an increase in entropy. When he is unaware of any difference, he observes no 
diffusion and no increase in the entropy which he is using. If two observers 
disagree, they must be interested in different phenomena, and there is no 
conflict.” 

To clarify his idea, Grad raised different situations that describe states of an n-particle 
gas. In the first situation, we label particles in the gas by 1, • • • ,n. In the second situation, 
we count the particle number of the gas in a certain region of space. These two situations 
are different and give rise to two different non-comparable entropies. In the first situation, 
when we remove the partition and mix the gas, the entropy S\ increases by rzln2. Then, 
when we reinsert the partition, the Si decreases by nln2 and returns to its original value. 
This is because we have complete information whether each particle is in the left or 
right side since we label all the particles. In the second situation, when we remove the 
partition, the entropy S 2 increases by n In 2, if we distinguish the particles originally in 
the two different sides when we count the number of particles. On the other hand, S 2 
does not increase, if we are not interested in any difference of the particles. In both cases, 
when we reinsert the partition, the entropy does not change. In this way, the entropies 
in these descriptions differ from each other. Although S\ is completely impractical from 
the viewpoint of thermodynamics, it does exist and give one description of the system. 

In summary, the notion of entropy depends on our interest or description of a system. 
If entropies differ from each other, it just implies a difference of our interest, and there 
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is no conflict. As to Gibbs’ paradox, the difference of the entropy production reflects our 
situation of whether or not we are interested in a difference of the particles. 


van Kampen (31 J 


In his essay, van Kampen discussed Gibbs’ paradox. The aim of his essay is to refute such 
statements as 

“It is not possible to understand classically why we must divide N\ to obtain 
the correct counting of states,” and “Classical statistics thus leads to a con¬ 
tradiction with experience even in the range in which quantum effects in the 
proper sense can be completely neglected.” 

First, he calculated the entropy S(P,T ) of an ideal IV-particle gas with pressure P 
and temperature T, and obtained 


S(P, T) = ^ Nk B In T - Nk B In P + C. 


( 6 . 10 ) 


Then, he emphasized, 

“the second law defines only entropy differences between states that can be 
connected by reversible change.” 

Therefore, the constant C is only required to be independent of P and T. Thus, at this 
point, there is no way to compare entropy with different N, unless we introduce a new 
reversible process that varies N. We consider a process in which we attach two boxes 
containing identical gases with the same ( P,T,N ), and open a channel between them. 
Then, the constant C should be proportional to N and we obtain 

S(P, T, N) = ^Nk P> In T - Nk B In P + cN, (6.11) 

where c does not depend on (P, T, N ). If two boxes contain two different gases A, B, the 
process in which the channel is opened is no longer reversible. Instead, we need to consider 
a process with semi-permeable walls. We combine two boxes using the semi-permeable 
walls in a reversible way, and then isothermally and quasi-statistically expand the box 
so that the pressure returns to its original value. Together with the convention of the 
additivity of the thermodynamic entropy, we obtain 


C, P N 

S(p , T, N a = N/ 2, N b = N/2) = -Nk B In T - Nk B In - + - (c A + c B ). (6.12) 


Even when we set A=B, Eq. (6.12) does not reduce to Eq. (6.11). This difference is what 
Gibbs’ paradox concerns. Then, van Kampen argued, 
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“The origin of the difference is that two different process had to be chosen for 
extending the definition of entropy. They are mutually exclusive: the first one 
cannot be used for two different gases and the second one does not apply to a 
single one.” 

Next, he considered the case in which A and B are so similar that an experimenter cannot 
distinguish them operationally, namely, he does not have the semi-permeable walls needed 
in the second process. Then, he will conclude 

S(P, T , N a = N/2, A'b = N/2) = ^Nk B In T - Nk B In P + cN. (6.13) 

This seems to be contradictory at a first glance, but 

“The point is, that this is perfectly justified and that he will not be led to 
any wrong results. If you tell him that ‘actually’ the entropy increased when 
he opened the channel he will answer that this is a useless statement since 
he cannot utilize the entropy increase for running a machine. The entropy 
increase is no more physical to him than the one that could be manufactured 
by taking a single gas and mentally tagging the molecules A or B. [...] The 
expression for the entropy (which he constructs by one or the other processes 
mentioned above) depends on whether or not he is able and willing to distin¬ 
guish between the molecules A and B. This is a paradox only for those who 
attach more physical reality to the entropy than is implied by its definition.” 

Van Kampen concluded 

“The question is not whether they are identical in the eye of God, but merely 
in the eye of the beholder.” 

Jaynes [32] 

Jaynes argued that the writing of Gibbs [29] contains a correct analysis of Gibbs’ paradox. 
However, this analysis has been lost due to ambiguity in the writing and the fact that 
Gibbs did not include it in his later renowned textbook [SS], Jaynes presented a “half 
direct quotation” of Gibbs’ explanation [32]. 

First of all, Jaynes stressed that we have to be circumspect about what we mean by 
the words “state” and “reversible.” He wrote 

“But by the word ‘original state’ we do not mean that every molecule has been 
returned to its original position, but only a state which is indistinguishable 
from the original one in the macroscopic properties that we are observing.” 
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Therefore, in the mixing of two different gases, the particles originally in the left (right) 
side must return to the left (right) when we say that the state returns to the original 
state. In contrast, in the mixing of two identical gases, we do not mean that the particles 
originally in the left (right) side should return to the left (right) when we say that the 
mixing is reversible and accompanied by no entropy production. We say that the mixing is 
reversible because all macroscopic properties (e.g. the chemical composition, the number 
of particles) return to their original value after we reinsert the partition. Thus, 

“Trying to interpret the phenomenon as a discontinuous change in the physical 
nature of the gases (i.e. in the behavior of their microstates) when they become 
exactly the same, misses the point. [...] We might put it thus: when the gases 
become exactly the same, the discontinuity is in what you and I mean by the 
words ‘restore’ and ‘reversible’.” 

To clarify this point, Jaynes continued his discussion. He noted that a thermodynamic 
state is defined by specifying a small number of macroscopic quantities {Xi,X 2 , • • • , X n }. 
The entropy is defined as a property of a macrostate specified by these quantities: S = 
S(Xi,X 2 , • • • , X n ). As indicated by the discussion of the gas mixing, the entropy is not a 
property of the microstate, whereas other thermodynamic variables as the total mass and 
the total energy are physically real properties of a microstate. In fact, the thermodynamic 
entropy is a property of a macrostate C(X), namely, a set of microstates compatible with 
X = {Ab, X 2 , • • • , X n }. Thus, it is possible to assign different entropies S, S' to the same 
microstate, if we choose different sets of macroscopic variables and embed the microstate 
in two different macrostates C, C'. This implies that we have to specify macroscopic vari¬ 
ables that we can measure and control in advance to define the thermodynamic entropy. 
Then, the thermodynamic entropy obeys the second law of thermodynamics as long as 
all experimental manipulations are within the set of macroscopic variables that we have 
chosen beforehand. Because this choice connotes whether we regard the gases as different 
or identical, the behavior of the entropy under the gas mixing hinges upon this choice. 

Summary 

As we have seen, the thermodynamic entropy is not an intrinsic property of a microstate, 
and the definition of the thermodynamic entropy involves arbitrariness. Whether the 
gases are identical or different is predetermined within our thermodynamic framework 
that we have chosen beforehand. Thus, it is meaningless to discuss the discontinuity in 
the thermodynamic entropy when we consider the infinitely similar gases. In this sense, 
Gibbs’ paradox is not a paradox. 
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Now that we qualitatively understand that Gibbs’ paradox is not a paradox even within 
classical thermodynamics, the remaining question is how to derive the factorial in the sta¬ 
tistical definition of the thermodynamic entropy, namely, the factor N\ in the definition 
of the entropy 


S 



(6.14) 


where W denotes the number of microstates, or in the definition of the partition function 

z = ^ J e- mr) dT, (6.15) 


where H(T) is the Hamiltonian of the system. In the following two sections, we review 
two well-known quantitative resolutions of Gibbs’ paradox. 


6.1.3 Quantum resolution 

The standard resolution of Gibbs’ paradox is based on quantum mechanics. In quantum 
theory, the interchange of identical particles does not lead to another state, and identical 
particles are indistinguishable in principle. This indistinguishability naturally leads to 
the factor N\. Many textbooks (for example, see Refs. [69H73] ) resolve Gibbs’ paradox in 
this way. 

Following Ref. [65], we consider an A'-particlc quantum system with a Hamiltonian 

^ = Ei^ + y ( x !>••• (6.16) 

i 

where m is the mass of the particles. Let \i) and E f denote the i-th eigenstate and eigenen- 
ergy, respectively. Then, the (unnormalized) density matrix of the canonical ensemble is 

pp = !*)(*!'■ ( 6 - 17 ) 

i 

The configurational representation reads 

P/3(xi, • • ■ , xjv; xi, • ■ ■ , x)v) = ^ e _/3Ei ^(xi, • • • , xjv)^*(xi, • • • , x^), (6.18) 

i 

where ^(xi,-- - ,xjv) = (xi, • • • ,xjv|*)- When the particles are bosons, this density 
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matrix must be symmetrized as 


p s n 


Xi,-” ,Xjv;xi,-- - j Xjv) = P/ 3 ( X 1 ^ • • • !),••• ; x'a(JV )) 5 ( 6 - 19 ) 


ctS5jv 


where Sjv is the symmetry group of degree n. The partition function is given by 


^■syn 


J dx 1 • • • dxAr P^ ym (xi, • • • , Xjv; x 1? • ■ ■ , xy) 

^ / dxi • • • dxjv , xjv; x (t( i ) , • • • , x CT (jv)) (6.20) 

' o-esv ^ 


The integrand in this equation involves factors 

m(xi - Xa(i)) 2 


exp 


2[3h 2 


( 6 . 21 ) 


In the classical (i.e., high-temperature) limit, due to this exponential decay, the dominant 
contribution in Eq. (6.20) is the term in which a is the identity permutation. Therefore, 


Eq. (6.20) is approximated as 

Zf m - ^ I dx i • ■ • dx N pp(x i, • • • , xjv; xi, • • • , x w ). 


( 6 . 22 ) 


In the case of fermions, a similar discussion leads to the same conclusion. In this way, the 
factor N\ can naturally be derived from quantum mechanics and this factor leads to the 
extensive thermodynamic entropy. 

Although this resolution is the standard resolution of Gibbs’ paradox, it involves two 
crucial problems. First, this resolution cannot apply to mesoscopic particles. Let us 
consider colloidal particles in liquid. Because colloidal particles have vast internal degrees 
of freedom, we cannot expect that these particles have the same internal states. Therefore, 
the wave function of this system should not be symmetrized. Hence, we cannot derive the 
factor N\, and thermodynamic quantities (e.g. entropy) fail to be extensive. This implies 
that the quantum resolution is inapplicable to a mesoscopic regime. The second point is 
more fundamental and to be explained at the end of the next section, because this point 
is closely linked to the topic described in the next section. 


6.1.4 Pauli’s resolution based on the extensivity 

As we see in the review of van Kampen’s work [31], the Clausius definition of thermo¬ 
dynamic entropy reveals nothing about the dependence of the entropy on the particle 
number. Pauli recognized this fact and gave a resolution within classical theory [331, 32j. 


110 







Pauli’s analysis is based on the extensivity of the thermodynamic entropy. 

Let S(T, V, N ) be a phenomenological thermodynamic entropy of an ideal iV-particle 


gas with temperature T and volume V defined by the Clausius definition (6.1). Then, 
from the definition, we obtain 


S(T, V , N) = -Nk B In T + Nk B In V + k B f(N), 


(6.23) 


where f(N) is not an arbitrary constant, but an arbitrary function of N. This is because 
the Clausius definition does not involve the 7V-dependence of the thermodynamic entropy. 
To determine f(N), we require the extensivity of the entropy as an additional condition. 
The extensivity means 


S{T,qV,qN) = qS{T,V,N), 


(6.24) 


where q is an arbitrary positive real number. Substituting Eq. (6.23) into Eq. (6.24), we 
obtain 


qNlnq + f(qN)=qf(N). 


(6.25) 


Differentiating with respect to q and setting q — 1, we obtain 

N + Nf'{N) = f(N). 

This equation can be rewritten as 

d (f(N)\ _ 1 

dN \ N J N' 

Therefore, we obtain 


/ (N) = N f (1) — N In N, 


(6.26) 


(6.27) 


(6.28) 


where the second term is the factor N\ due to the Stirling formula in the large- N limit, 
i.e., in the thermodynamic limit. Thus, the thermodynamic entropy is 

S(T,V,N) = ^Nk B \nT + Nk B ln^ + Nk B f(l). (6.29) 

By this form, we see that the entropy is extensive and /(1) is essentially the chemical 
potential. In this way, the requirement of the extensivity leads to the factor IV! in the 
phenomenological entropy defined by the Clausius in the thermodynamic limit. 
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The same argument applies to the entropy defined by classical statistical mechanics. 
The reason why we identify the entropy defined by classical statistical mechanics as the 
thermodynamic entropy is that these two entropies have the same response to any vari¬ 
ation of such macroscopic variables as the temperature and volume. In mathematical 
terms, these two entropies have the identical differential form. Therefore, what we can 
conclude about relations between the thermodynamic entropy S(T, V, N ) and the entropy 
in classical statistical mechanics S C (T,V, N) is 

S(T, V, N ) = S C (T, V, N) + k B f c (N). (6.30) 


For an IV-particle ideal gas, we have 


S C {T,V,N) = -JVfc B [ln(2!rmfc B T) + 1] + Nk B In V - 3Nk B ln£, 


(6.31) 


where £ is a constant of the dimension of action. Then, the requirement of the extensiv- 


ity (6.24) leads to 


f c (N) = Nf c (l) - NlnN. (6.32) 

The extensivity reproduces the second term on the right-hand side, i.e., the factor N\ 
again. 

Quantum statistical mechanics is in the the same position as classical statistical me¬ 
chanics. We have 


S(T, V, N ) = S q {T, V, N ) + k B f Q (N). (6.33) 

In the classical limit, since we have 

S q (T, V, N) ~ S C {T, V, N ) - In IV!, (6.34) 

the requirement of the extensivity leads to 

/ q (A 0 = /V/«(l) + In (6.35) 

where the second term on the right-hand side vanishes in the thermodynamic limit. Hence, 
f®(N) has only a trivial dependence on N as 

/Q(7V)~AT/Q(l), (6.36) 
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which corresponds to the contribution from the chemical potential. In this manner, the 
procedure to determine an arbitrary function of N is needed for quantum statistical me¬ 
chanics as well, although the result is simpler than that of classical statistical mechanics. 
Therefore, [31] 

“the Gibbs paradox is no different in quantum mechanics, it is only less man¬ 
ifest.” 

As we have seen, the quantum resolution is irrelevant to Gibbs’ paradox. The reso¬ 
lution based on the extensivity is a better and logical resolution, and applicable to the 
phenomenological entropy, the entropy in classical statistical mechanics and the entropy 
in quantum statistical mechanics. However, this resolution still suffers a problem. The 
resolution is only applicable to systems in the thermodynamic limit in which we are en¬ 
titled to require the extensivity of the entropy. Namely, it ignores deviations from the 
extensivity, which are essential to deal with mesoscopic physics and surface effects. 

6.2 Resolution from absolute irreversibility 

In this section, we resolve Gibbs’ paradox based on the nonequilibrium equalities with 
absolute irreversibility. We explain why the entropy productions of the two mixing pro¬ 
cesses are different from each other, and then derive the factor N\. Our resolution is valid 
even for non-extensive entropy, for which the resolution given by Pauli breaks down. 

6.2.1 Requirement and Results 

In the resolution by Pauli, we require the extensivity of the thermodynamic entropy. 
Instead of this requirement, we require the additivity of the thermodynamic entropy. In 
mesoscopic systems, the extensivity breaks down, whereas the additivity remains valid 
as long as the interaction is short-range. The additivity plays a crucial role when we 
compare the thermodynamic entropy with different N. 

Under the requirement of the additivity, in the context of our nonequilibrium equality, 
we show that the entropy production of the mixing of two identical V-particle gases is 

AS ld = 0 (6.37) 

and that the entropy production of the mixing of two different iV-particle gases is 

A5 dif = 2Nk B In 2 (6.38) 
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in the thermodynamic limit. This difference of the two processes will be explained in terms 
of absolute irreversibility. Moreover, we will derive the factor N\, that is, we show that 
the arbitrary function f c (N) in classical statistical mechanics should take the following 
form: 


f c (N) = Nf c (l)-lnN\. 


(6.39) 


This result is valid for a finite N without the thermodynamic limit. 


6.2.2 Difference of the two processes 

We consider a difference between the mixing of two identical gases and that of two dif¬ 
ferent gases in terms of absolute irreversibility. First, we consider the reverse process 


of the mixing of two identical gases illustrated in Fig. 6.2 (a) to evaluate the absolute 
irreversibility. Initially, an ideal 2IV-particle gas is in the equilibrium of the entire box. 
Then, we insert the partition in the middle. Let n denote the number of the particles 
found in the left side after the insertion. The event of n = N is the only event that has the 


corresponding event in the original process (see also Fig. 6.1 (b)). Therefore, the events 


oin^N are singular because they have no counterparts. Hence, the singular probability 
is calculated as 


*s = 1 - 


2nCn 

2 2N 


(6.40) 


Secondly, we consider the reverse process of the mixing of two different gases illustrated 


in Fig. 6.2 (b). Initially, two ideal IV-particle gases of different kinds are at thermal 
equilibrium in the entire box. Then, we insert the partition in the middle. To recover the 
original state, the particle number in the left side after the insertion must be N. Moreover, 
the chemical composition must return to the original state. Namely, the particles from 
the left (right) side in the original process must return to the left (right) side. This fact 
makes sharp contrast to the case of two identical gases, in which the particles from the 
left (right) may go to the right (left) sides as long as the particle number returns to N. 
The only non-singular event is the one in which all the particles of one kind return to the 
left side and the rest particles return to the right side. All the other events indicated by 
blue arrows are singular. Thus, the singular probability is 


\ dif _ i 

A S 


(6.41) 
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Figure 6.2: (a) Reverse process of the mixing of two identical gases. Initially, an ideal 

2iV-particle gas is at thermal equilibrium in the entire box. Then, we insert the partition 
in the middle. The particle number in the left side n varies from 0 to 2 N according to 
the binomial distribution. The events of n ^ N indicated by the blue arrows are singular 
because they have no counterparts in the original (i.e., forward) process, (b) Reverse 
process of the mixing of two different gases. Initially, two ideal iV-particlc gases of different 
kinds are at thermal equilibrium of the entire box. Then, we insert the partition in the 
middle. To return to the original state, not only must the particle number in the left side 
be N, but also the composition of gas must be the same as the original state. Therefore, 
the mixing of two different gases has much more absolute irreversibility than the mixing 
of two identical gases. 
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In this way, the difference of the intuitive physical descriptions in the reversed processes 
is quantitatively characterized by the difference of the probabilities of the absolutely 
irreversible paths. 

Next, we connect these singular probabilities to the thermodynamic entropy produc¬ 


tion. The Jarzynski-type nonequilibrium equality (4.33) in the presence of absolute irre¬ 
versibility reads 


(g-^-AF)) = i _ As . 


(6.42) 


Since work is zero (W = 0) in the mixing process, we obtain 


A F = k B T ln(l - Ac 


(6.43) 


Thus, in terms of the thermodynamic entropy, we obtain 


AS = —k B ln(l - A s ). 


(6.44) 


Substituting Eq. (6.40) into Eq. (6.44), we obtain 


A S id = 2 Nk n In 2 - kn In , N C 


B m 2N'~-'N- 


(6.45) 


When N is sufficiently large, the combination in Eq. (6.45) is approximated as 

o27V 

C N ~ 


27V O'AT 


V^n' 


Therefore, Eq. (6.45) reduces to 


A S ld ~ -kn In 7 tN. 


(6.46) 


(6.47) 


Since this value is sub-extensive, we have 


AS id = 0 


(6.48) 


in the thermodynamic limit. This is consistent with the fact that the removal of the 
partition becomes reversible in the large -N limit due to the law of large numbers, namely, 
the reinsertion of the partition results in the state of n = N with almost unit probability. 


In the case of two different gases, from Eqs. (6.41) and (6.44), we obtain 


AS dii = 2 Nkn In 2. 


(6.49) 
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Figure 6.3: (a) Mixing of an IV-particle gas and an M-particle gas of the same kind. A 

box is divided into equal halves by a partition. Initially, the IV-particle gas is in the left 
side, and the M-particle gas is in the right side. We remove the partition and the gases 
expand to the entire box. (b) The reverse process. Initially, an (N + M)-particle gas is at 
thermal equilibrium in the entire box. Then, we insert the partition. The number of the 
particles in the left side n varies from 0 to N + M according to the binomial distribution. 


The difference of these entropy productions of the two processes originates from the dif¬ 
ference of the degree of absolute irreversibility, namely, the difference in the behaviors 
under the reinsertion of the partition. 


6.2.3 Derivation of the factor N\ 


To determine the arbitrary function / C (1V) in Eq. (6.30), we consider a mixing process 


of an IV-particle gas and an M-particle gas of the same kind illustrated in Fig. 6.3 (a). 
To evaluate the probability of absolute irreversibility, we consider the reverse process (see 


Fig. 6.3 (b)). We insert the partition in the middle in an (N + M)-particle gas. The 


number of particles in the left side, n, after the reinsertion must be N to recover the 
original state. This is the only non-singular event. Therefore, the singular probability is 


Ac — 1 — 


n+mCn 

2N+m 


(6.50) 


Thus, from Eq. (6.44), the thermodynamic entropy production is 


A S — (A T M^)k B In 2 — k g In n+mCn- 


(6.51) 


On the other hand, from Eq. (6.31) and the additivity of the thermodynamic entropy, we 
obtain 


AS = (N + M)k B In 2 + k B f c (N + M) - k B f c (N ) - k B f c (M) (6.52) 
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Therefore, we obtain 


f C (N + M) - f c (N) - f c (M) = - In N+M C N . (6.53) 


When set M — 1, we have 


f C (N + 1) - f c (N) - / c ( 1) = - In (N + 1). 


(6.54) 


Let us define 


9 c (iV)=exp[/ c (Af)], 


and then Eq. (6.54) reduces to 


,C / A 7- , ^ „Cf AT\ fi ,C (^-) 


g^(N+l)=g^N) 


N + 1 


This equation can be rewritten as 


(N + 1)! • g c (N + 1) = g c ( 1) • N\ ■ g c (N). 


(6.55) 


(6.56) 


(6.57) 


Therefore, we obtain 


N\ ■ g c (N) = {g c (l)} N (6.58) 

and 

( 6 - 59 ) 

ffence, we conclude 

f c (N) = Nf c (l) - In N\. (6.60) 

Thus, the desired factor N\ is reproduced. 

In summary, based on our nonequilibrium equality that is applicable in the presence 
of absolute irreversibility, we have shown that the difference of the entropy productions 
in the two mixing processes originates from the difference of the degree of absolute irre¬ 
versibility. Furthermore, we have reproduced the factor N\ in the relation between the 
thermodynamic entropy and the classical statistical mechanical entropy. Our new reso¬ 
lution automatically takes account of the sub-leading term and mesoscopic effects in the 
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thermodynamic entropy, which was ignored in the resolution based on the extensivity of 
the thermodynamic entropy. 
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Chapter 7 

Conclusions and Future Prospects 


7.1 Conclusions 

In this thesis, we have investigated the situations to which the conventional integral 
nonequilibrium equalities cannot apply, and proposed a new concept of absolute irre¬ 
versibility to describe these situations in a unified manner. In absolutely irreversible pro¬ 
cesses, some of time-reversed paths have no counterpart in the original forward process, 
and the entropy production diverges in the context of the detailed fluctuation theorems. 
In mathematical terms, absolute irreversibility is defined as the singular part of the time- 
reversed probability measure with respect to the forward probability measure. Lebesgue’s 
decomposition enables us to separate the absolutely irreversible part from the ordinar¬ 
ily irreversible part. As a consequence, we have obtained the integral nonequilibrium 
equalities in the presence of absolute irreversibility. The obtained equalities involve two 
physical quantities related to irreversibility: the entropy production representing ordinary 
irreversibility and the singular probability describing absolute irreversibility. The corre¬ 
sponding inequalities give tighter fundamental restrictions on the entropy production in 
nonequilibrium processes than the conventional second-law like inequalities. Our nonequi¬ 
librium equalities have been verified in free expansion and in numerical simulations of the 
two Langevin systems. 

Moreover, we have generalized our nonequilibrium equalities in absolutely irreversible 
processes to the situations in which the system is subject to measurement-based feedback 
control. We have transformed the obtained nonequilibrium equalities and introduced 
a concept of unavailable information, which characterizes the inevitable inefficiency of 
feedback protocols. As a result, we have derived inequalities that give an achievable 
lower bound of the entropy production. We have verified our information-thermodynamic 
absolutely irreversible nonequilibrium equalities in the process with a measurement and 
trivial feedback control and in the two- and multi-particle Szilard engines. 
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We have applied the notion of absolute irreversibility to the gas-mixing problem of 
Gibbs’ paradox. The difference between the entropy production of the mixing process of 
two different gases and that of two identical gases originates from the difference of absolute 
irreversibility, i.e., different behaviors under the reinsertion of the partition. Moreover, we 
have reproduced the factorial in the particle-number dependence of the thermodynamic 
entropy. Our quantitative resolution of Gibbs’ paradox applies to a classical mesoscopic 
regime, where Pauli’s resolution based on the extensivity of the thermodynamic entropy 
breaks down. 


7.2 Future prospects 

As future prospects, I enumerate several outstanding issues. 

First of all, I intend to generalize our absolutely irreversible integral nonequilibrium 
equalities to the quantum regime. A part of this extension has already been done, and 
we have shown that absolute irreversibility is essential under inefficient feedback control 
and projective measurements m However, nonequilibrium equalities under the condition 
that the initial state has such quantum correlations as entanglement are elusive. Absolute 
irreversibility may play an important role in these quantum nonequilibrium situations. 

Next, I intend to apply our formulation based on measure theory to the chaotic sys¬ 
tems. Although the nonequilibrium equalities were first proven in chaotic systems, most 
of recent researches of nonequilibrium equalities are restricted to such simple systems as 
the Langevin systems. I expect our formulation is compatible with the chaotic system 
that has a singular continuous probability measure with respect to the Lebesgue mea¬ 
sure. This issue may be related to a topic of thermalization because the relaxed state 
after a nonequilibrium process starting from the canonical distribution sometimes exhibits 
singular behaviors. 

Finally, I contemplate applying our nonequilibrium equality to finite-time thermody¬ 
namics. Finite-time thermodynamics is a field of thermodynamics that puts emphasis on 
power of thermodynamic engines and seeks to their optimal efficiency. Therefore, thermo¬ 
dynamic engines under consideration are subject to finite-time nonequilibrium processes. 
Recently, in the context of nonequilibrium equalities, the efficiency of finite-time engines 
has been studied ra- i expect that our nonequilibrium equalities with absolute irre¬ 
versibility give new restrictions on the efficiency. 


121 


Appendix A 

From the Langevin Dynamics to 
Other Formulations 


In this Appendix, we derive the path integral formula (2.39) and the Fokker-Planck equa¬ 


tion (2.41) from the overdamped Langevin equation 


x{t) = /iF(x(t),X(t)) + COO, 


(A.l) 


where ((t) is a white Gaussian noise satisfying 


(m) = o, 

(C(t)C(O) = 2 D5(t-t'). 


(A.2) 

(A.3) 


A.l Path-integral formula 


To calculate the path probability, we first discretize the time interval [0, r] into N sections 
with the same length At = r/N. We define t t = iAt (i — 0,1, • • • , N), ay = x(ti), and 
A i = Then, the discretized Langevin equation read^j] 


F(x i ,X i ) + F(x i+1 ,X i+1 ) 

x i+1 - Xi = /i--- At + AHy + i, 


(A.4) 


where 


AIW+, = 



1 We use the Stratonovich convention. 


(A.5) 
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for % = 0, • • • ,N — 1. The discretized noise A Wi obeys the Gaussian distribution with the 
average 


(A Wi) = / (C (t))dt = 0 


(A.6) 


and the variance 


(Abbj 2 ) = 


a ti 

j-1 

a U 

j-1 




2D5(t — t')dtdt' 


2 Ddt 


= 2D At. 


(A.7) 


Therefore, the probability distribution of AW t is given by 


1 T Abb 2 

P(AWj) = — =- exp --A1L. 

x/AnDAt AD At 


(A.8) 


Thus, the joint probability distribution of all A W* is calculated as 


1 i iV 

V[{AW}] = . exp V A W\ 

(VAirDAt) n AD At 


/47^DA^) , 


exp ^ f xi+i—Xi ^F( x i, \i) + F(xi + i, \i+i)\ ^ 


(A.9) 


In the large-A limit, we obtain 


V[{AW}}=Uex p J (x(t) - fiF(x(t),\(t))) dt , 


(A. 10) 


where J\f is the normalization constant. 

The path probability P[{a:}|a;o] is defined in terms of the probability distribution 
P[{AW}} as 


VftxyixolYldxi = V[{AW}} \\dAW t . 


(A.ll) 
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To obtain the path probability, we calculate the Jacobian as 


„ , / dAWi \ 

J = det (isr) 


= det (Sij (l - | d x F(xi , A,) At) - 8 ^ (l + ^d x F(x^i, Aj_i)At)) 


N 


II (l - ^ d x F(xi, Xi)At 

i— 1 


N 


i= 1 


= exp 


P[exp -^d x F(xi,\i)At 

N 

^ | d x F(x i: Xi)At 

i= 1 

Thus, in the continuous limit (At —» 0), we obtain 

J = exp 

and therefore 


(A. 12) 


^d x F(x(t),\(t))dt 


(A. 13) 


V[{x}\x„] = V[{AW}]J 
= Af exp 


■At) - mm m? + m F{x{tl m | dt 


4 D 


(A. 14) 


which is nothing but the path integral formula for the overdamped Langevin equation 


(2.39). 


A.2 Fokker-Planck equation 

First of all, we evaluate the time evolution of an arbitrary function f(x(t)) as 

df(x(t)) = f(x(t + dt)) - f(x(t)) 

= f'(x(t))dx + -f" (x(t))dx 2 + 0((dx) 3 ), 


(A.15) 


where 


/ t-\-dt 

((t')dt' + 0((dt) 2 ). (A.16) 
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We take the statistical average of Eq. (A. 15). Substituting Eq. (A.16) into the average of 


the first term on the right-hand side of Eq. (A.15), we obtain 


f't+dt 


(f(x(t))dx) = (fiF(x(t), X(t))f(x(t)))dt + / (f{x{t))((t'))dt' + 0((dt) 2 ). (A.17) 


Since the stochastic quantity f(x(t )) at time t is independent of the noise £(£') for t' > t, 
we obtain 


{/'W«)K(q> = </'(^W)>(C(O> = 0 


(A. 18) 


Therefore, Eq. (A.17) reduces to 


{.f\x(t))dx) = A(f ))}\x(t)))dt + 0((dtf 


(A. 19) 


In a similar way, the average of the second term on the right-hand side of Eq. (A.15) 
reduces to 


^{f"(x(t))dx 2 ) = ^(n 2 F(x{t),\(t)) 2 f"(x(t)))dt 2 + 


pt+dt 


(fiF(x(t), X(t))f'\x(t))((t'))dt 


t*t~\~dt rt+dt 

Jt Jt 

rt-\-dt rt-\-dt 


(amnnxmdt'dt" + 


= D J J 5(t' -t"){f"{x{t)))dt'dt" + 0((dt) 2 ) 
= D(f"(x(t)))dt + 0((dt) 2 ). 


(A.20) 


Therefore, the average of Eq. (A.15) is 


X(t))f'(x(t)))dt + D(f"(x(t)))dt + 0((dt) 2 ). (A.21) 


To derive the Fokker-Planck equation, we note that the probability p(x, t) to hnd the 
Langevin particle at position x at time t is given by 


p(x,t) = (S(x-x(t))). 


(A.22) 


If we set f(x(t)) = 5(x — x(t)), Eq. (A.21) reduces to 


dp(x, t ) = —(pF(x(t), X(t))5'(x — x(t)))dt + D(S"(x — x(t)))dt + 0((dt) 2 ) 

= —d x (pF(x(t), X(t))5(x — x(t)))dt + Dd 2 {8(x — x(t)))dt + 0((dt) 2 ) 

= —d x [p,F(x, X(t))p(x,t)]dt + Ddlp(x,t)dt + 0((dt) 2 ). (A.23) 
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Thus, we obtain the following Fokker-Planck equation 


d t p(x, t ) = 


-d x [(nF(x, X(t)) - Dd x )p(x,t)]. 


(A.24) 
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Appendix B 

Measure Theory and Lebesgue’s 
Decomposition 


In this Appendix, we briefly review measure theory and Lebesgue’s decomposition theo¬ 
rem. This Appendix is mainly based on [26] . 


B.l Preliminary subjects 


Definition 1 (cr-algebra) A family A of subsets of X is said to be a a-algebra if the 
following three conditions are met: 

(i) 0 and A" belong to X ; 

(ii) If A belongs to A", then X\A belongs to A; 

(iii) If (A n ) is a sequence of sets in X, then lj^=i A n belongs to X. 

Definition 2 (measurable space) An ordered pair (A", A) consisting of a set X and a 
a-algebra X of subsets of X is called a measurable space. 


Definition 3 (measure) A measure is an extended real-valued function /j defined on 
a cr-algebra X of subsets of A" satisfying the following conditions: 

(i) p,(0) = 0; 

(ii) n(E) > 0 for all E G A; 

(iii) fi is countably additive, i.e., if ( E n ) is any disjoint sequence of sets in A, then 


h 



— i^{E n ). 

71=1 


(B.l) 
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Definition 4 (measure space) A measure space is an ordered triad (X,X,n) con¬ 
sisting of a set X, a a -algebra X of subsets of X and a measure // defined on X. 

Definition 5 (((x-)finite measure) Let (X,X,/j,) be a measure space. If // does not 
take on an infinite value, we say that // is finite. If there exists a sequence ( E n ) of sets 
in X with X = 1J^L 1 E n and fJ>(E n ) < oo for all n, then we say (i is tr-finite. 

Definition 6 (mesurable function) An M-valued function / with domain X is said to 
be A-measurable if for any real number a the set 

{xeX\f(x)>a} (B.2) 


belongs to X. 

Definition 7 (almost everywhere) Let p be a measure on X. A certain proposition is 
said to hold //-almost everywhere on A" if there exists a subset N E X with = 0 
such that the proposition holds on X\N. 

Theorem 1 Suppose that / is a nonnegative A-measurable function. Then, f[x) = 0 
//-almost everywhere on X iff 

f fdji = 0. (B.3) 


B.2 Classification of measures 


Definition 8 (absolutely continuous) A measure v on X is said to be absolutely 
continuous with respect to a measure // on X if E e X and n(E) = 0 imply v(E) = 0. 
In this case, we write v //. 


Lemma 1 Let // and v be finite measures on X . Then v <C // iff for every e > 0 there 
exists a S > 0 such that E e X and //(A) < 6 imply that v(E) < e. 

Proof. If this condition is satisfied and //(A) = 0, then v{E) < e for all e > 0, which 
implies v(E) = 0. 

Conversely, suppose that there exist some e > 0 and E n e X with n(E n ) < 2~ n and 
v{E n ) > e. Let F n = \J£ =n E k , so that n(F n ) < 2~ n+l and v{F n ) > e. Since (F n ) is a 
decreasing sequence of measurable sets and //, v are finite measures, we have 





lim n(F n ) 

n—>oo 



lim u(F n ) > e. 

n—>oo 


(B.4) 
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Therefore, v is not absolutely continuous with respect to \i. 


□ 


Intuitively, v <C fi means that a set that has a small /^-measure also has a small 
//-measure. 

Definition 9 (singular) Two measures /j and v on X are said to be mutually singular 

if there are sets A and B G X that satisfy X = A U B, 0 = A fl B and fi(A) = v{B) = 0. 
In this case, we write /ill/. 

Despite this symmetric definition, we also say that v is singular with respect to ji. 


Lemma 2 Let a be a measure such that a <C fi and a _L n, then a = 0. 

Proof. Since a T /i, there exist sets A and B such that 

X = AUB, 0 = A fl B, a (A) = 0, ju(B) = 0. (B.5) 

Since a <C fi and fJ>(B) = 0, a(B) = 0. Then, due to the additivity of a, we have 

a(X) = a(A) + a(B) = 0. (B.6) 

It follows that for all E e X 

0 < a(E) = a(X) - a(X\E ) < 0, (B.7) 

which implies a = 0. □ 


Definition 10 (discontinuous point) For u G X , if ^({ca}) > 0, then ca is said to be 
a discontinuous point of fl. 

Definition 11 (discrete measure) Let C denote the set of all the discontinuous points 
of ii. Then, // is said to be discrete measure if /x( X) = 

Definition 12 (continuous measure) A measure // is said to be a continuous mea¬ 
sure if fi has no discontinuous points. 

B.3 Radon-Nikodym theorem 

Theorem 2 (Radon-Nikodym theorem) Let /i and v be cr-finite measures defined on 
X and suppose that v is absolutely continuous with respect to //. Then, there exists a 
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nonnegative A’-measurable function / such that 

u(E) = [ fdg, y E G T. (B.8) 

J E 

Moreover, the function / is uniquely determined //-almost everywhere. 

The function / is referred to as the Radon-Nikodym derivative, and formally written 
as 


du 

dg 


(B-9) 


Moreover, / is the transformation function from u to g. In fact, for an arbitrary X- 
measurable function g, we have 



du 

g—d/i 
: dp 



W E G X. 


(B.10) 


B.4 Lebesgue’s decomposition theorem 

Theorem 3 (Lebesgue’s decomposition theorem 1) Let g and u be u-hnite mea¬ 
sures defined on a u-algebra X. Then there exist measures z/ac and u$ such that u = 
z/ac + Vs, u ac p and u s _L g. Moreover, the measures z/ac and z/g are unique. 

Proof. Let A = g + u. Then, g and u are absolutely continuous with respect to A. 
Therefore, we can apply the Radon-Nikodym theorem to obtain 

g(E) = [ fdX, u(E) = [ gdX (B.ll) 

J E J E 

for all E G X, where /, g are nonnegative T-measurable functions. Let A = {x\f(x) > 0} 
and B = {x\f(x) = 0} so that A n B = 0 and X = A U B. 

Define z/ac and u s for E G X by 

u\c(E) — u{E D A), us(E) = u(E fl B). (B.12) 

Since u is additive, u(E) = u\q(E) + ug(E). To see z/ac g, we note that if g(E) = 0, 
then 

[ fdX = 0. (B.13) 

J E 


130 



In accordance with Theorem[lJ fix) = 0 for A-almost all x G E, which means \{EP\A) = 0. 
Since v <C A, v{E D A) = 0, and then vac(E) = 0. Thus, uac is absolutely continuous 
with respect to /i. On the other hand, since (A) = /i(I?) = 0, is singular with respect 
to /i. 

The uniqueness of this decomposition can be established by Lemma [2j □ 


Lemma 3 Let v be a measure defined on a a-algebra X. Then there exist measures z/ c 
and u d such that v = u c + z/ d , where u c is continuous and v d is discrete. Moreover, the 
measures is c and u d are unique. 

Proof. Let C be the set of all the discontinuous points of u. Define u c and u d for E e X 
by 


u c {E)=u{E\C), u d {E) = u{Er\C). 

Then, we can show that v c is continuous and u d is discrete. 
Suppose 


(B.14) 


v c + = v' c + (B.15) 

where v' c is continuous and u' d is discrete. If u d ^ u' d , there exists a single point oj e X such 
that z/ d ({w}) ~f~ ^({a;}). On the other hand, due to the continuity, u c ({c j}) = = 0. 

These relations lead to 


z/ c (M) + z/ d ({o;}) ^ i/ c ({u}) + ^ d ({^}), 


(B.16) 


which contradicts Eq. (B.15). Hence, u d = u d and therefore the uniqueness of the decom¬ 
position is established. □ 


To introduce a stronger version of Lebesgue’s decomposition, we prove the following 
lemma. 

Lemma 4 Let p and v be measures on a cr-algebra. If ji is continuous and v is discrete, 
then v is singular with respect to /i. 

Proof. Let C denote the set of all the discontinuous points of u. Then, u(X\C) = 0. On 
the other hand, since // is continuous and C is countable, p(C') = 0. Thus, u and /i are 
mutually singular. □ 
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Theorem 4 (Lebesgue’s decomposition theorem 2) Let // and v be a-finite mea¬ 
sures defined on a cr-algebra X and suppose fi is continuous. Then there exist measures 
z/ ac , ^sc an d vd such that u = u ac + u sc + rq, where z/ ac <C /i; -L /i and ^ sc is continuous; 
z/ d is discrete. Moreover, the measures /y ac , i/ sc and z/ d are unique. 

Proof. We can apply Theorem [3] to uniquely decompose 

v = ^ac + W (B.17) 

where z/ ac is absolutely continuous with respect to /i, and z/ s is singular with respect to /i. 
In accordance with Lemma |3j we can uniquely decompose v s into two parts: 

v & u sc T u d , (B.18) 

where u sc is continuous and z/ d is discrete. Thus, we obtain the following decomposition: 

Z' = ^ac + ^sc + Z'd- (B.19) 

The uniqueness of this decomposition follows from Lemma |4} □ 

The continuity of n is needed to establish the uniqueness of the decomposition. If /i 
has a discontinuous point uj G X, then the measure v w that satisfies u u ({u}) = (X ) is 
absolutely continuous with respect to /j and discrete at the same time. 


132 


Acknowledgement 


The studies in this thesis was done when the author was a master-course student in 
Masahito Ueda group in the University of Tokyo. This thesis would not be completed 
without help of a lot of collaborators and colleagues. 

First of all, I would like to express my best gratitude to my supervisor, Prof. Masahito 
Ueda. He made me cognizant of the topic in this thesis and gave me a lot of insightful 
comments throughout discussions. He also read the manuscript with great attention and 
gave me enormous suggestions. 1 would also thank him for the best research environment 
that he arranged for me. 

1 would also like to thank my collaborator, Ken Funo, for fruitful discussions on our 
studies and for teaching me quantum aspects of nonequilibrium equalities. 

1 am thankful for my collaborator, Yuto Ashida, for his constructive ideas from his 
deep comprehension of our field. 

1 am grateful to Prof. Takahiro Sagawa for his critical comments and beneficial dis¬ 
cussions on our work. 

1 appreciate critical comments and constructive suggestions by Prof. Shin-ichi Sasa. 

1 acknowledge comments from anonymous referees of our article izq, which I find very 
useful to clarify physical meanings of our work. 

1 would like to express my deep sense of gratitude to the members in Masahito Ueda 
group, especially to Yui Kuramochi and Tomohiro Shitara for fruitful discussions on math¬ 
ematical aspects of our work, and to Tatsuhiko N. Ikeda for suggestive comments. 

1 am also indebted to the members in Masaki Sano group in the University of Tokyo, 
especially to Kyogo Kawaguchi for lecturing me on theoretical aspects of mesoscopic 
physics and to Yohci Nakayama, Yuta Hirayama and Daiki Nishiguchi for teaching me 
experimental techniques of this field. 

Finally, I acknowledge financial support from the Japan Society for the Promotion of 
Science (JSPS) through the Program for Leading Graduate Schools (MERIT). 


133 


Bibliography 


[1] D. J. Evans, E. G. D. Cohen, and G. P. Morriss, Phys. Rev. Lett. 71, 2401 (1993). 

[2] G. Gallavotti and E. G. D. Cohen, Phys. Rev. Lett. 74, 2694 (1995). 

[3] J. Kurchan, J. Phys. A: Math. Gen. 31, 3719 (1998). 

[4] J. L. Lebowitz and H. Spohn, J. Stat. Phys. 95, 333 (1999). 

[5] D. J. Searles and D. J. Evans, J. Chem. Phys. 113, 3503 (2000). 

[6] D. J. Evans, D. J. Searles, and E. Mittag, Phys. Rev. E 63, 051105 (2001). 

[7] G. Gallavotti, Phys. Rev. Lett. 77, 4334 (1996). 

[8] C. Jarzynski, Phys. Rev. Lett. 78, 2690 (1997). 

[9] C. Jarzynski, Phys. Rev. E 56, 5018 (1997). 

[10] G. E. Crooks, Phys. Rev. E 60, 2721 (1999). 

[11] G. E. Crooks, Phys. Rev. E 61, 2361 (2000). 

[12] J. C. Maxwell, Theory of Heat (Appleton, 1871). 

[13] L. Szilard, Z. Phys. 53, 840 (1929). 

[14] L. Brillouin, J. Appl. Phys. 22, 33 (1951). 

[15] R. Landauer, IBM J. Res. Dev. 5, 193 (1961). 

[16] S. Toyabe, T. Sagawa, M. Ueda, E. Muneyuki, and M. Sano, Nature Phys. 6, 988 
( 2010 ). 

[17] J. V. Koski, V. F. Maisai, T. Sagawa, and J. P. Pekola, Phys. Rev. Lett. 113, 030601 
(2014). 

[18] T. Sagawa and M. Ueda, Phys. Rev. Lett. 100, 080403 (2008). 


134 



[19] T. Sagawa and M. Ueda, Phys. Rev. Lett. 102, 250602 (2009). 

[20] T. Sagawa and M. Lleda, Phys. Rev. Lett. 104, 090602 (2010). 

[21] D. H. E. Gross, arXiv cond-mat, 0508721 (2005). 

[22] C. Jarzynski, arXiv cond-mat, 0509344 (2005). 

[23] T. Sagawa and M. Lleda, Phys. Rev. Lett. 109, 180602 (2012). 

[24] J. M. Horowitz and S. Vaikuntanathan, Phys. Rev. E 82, 061120 (2010). 

[25] P. R. Halmos, Measure Theory (Springer, 1974) pp. 134, 182. 

[26] R. G. Bartle, The Elements of Integration and Lebesgue Measure (John Wiley & Sons 
Ltd., 1995) p. 88. 

[27] Y. Murashita, K. Funo, and M. Lleda, Phys. Rev. E 90, 042110 (2014). 

[28] Y. Ashida, K. Funo, Y. Murashita, and M. Ueda, Phys. Rev. E 90, 052125 (2014). 

[29] J. W. Gibbs, On the Equilibrium of Heterogeneous Substances (Connecticut Acad. 
Sci., 1875-78). 

[30] H. Grad, Comm. Pure and Appl. Math. 14, 323 (1961). 

[31] N. G. van Kampen, The Gibbs Paradox, edited by W. E. Parry (Pergamon, 1984). 

[32] E. T. Jaynes, The Gibbs Paradox, edited by C. R. Smith, G. J. Erickson, and P. O. 

Neudorfer (Kluwer Academic Publishers, 1992). 

[33] W. Pauli, Thermodynamics and the Kinetic Theory of Gases (MIT Press, 1973). 

[34] D. J. Evans and D. J. Searles, Phys. Rev. E 50, 1645 (1994). 

[35] R. Kubo, J. Phys. Soc. Jpn. 12, 570 (1957). 

[36] R. Kubo, M. Yokota, and S. Nakajima, J. Phys. Soc. Jpn. 12, 1203 (1957). 

[37] L. Onsager, Phys. Rev. 37, 405 (1931). 

[38] L. Onsager, Phys. Rev. 38, 2265 (1931). 

[39] G. Gallavotti and E. G. D. Cohen, J. Stat. Phys. 80, 931 (1995). 

[40] G. M. Wang, E. M. Sevick, E. Mittag, D. J. Searles, and D. J. Evans, Phys. Rev. 
Lett. 89, 050601 (2002). 


135 



[41] G. M. Wang, J. C. Reid, D. M. Carberry, D. R. M. Williams, E. M. Sevick, and 
D. J. Evans, Phys. Rev. E 71, 046142 (2005). 

[42] C. Jarzynski, J. Stat. Phys. 96, 415 (1999). 

[43] S. Park and K. Schnlten, J. Chem. Phys. 120, 5946 (2004). 

[44] G. Hummer and A. Szabo, Proc. Natl. Acad. Sci. USA 98, 3658 (2001). 

[45] J. Liphardt, S. Dumont, S. B. Smith, J. Ignacio Tinoco, and C. Bustamante, Science 
296, 1832 (2002). 

[46] N. C. Harris, Y. Song, and C.-H. Kiang, Phys. Rev. Lett. 99, 068101 (2007). 

[47] G. E. Crooks, J. Stat. Phys. 90, 1481 (1998). 

[48] C. Jarzynski, J. Stat. Phys. 98, 77 (2000). 

[49] D. Collin, F. Ritort, C. Jarzynski, S. B. Smith, J. I. Tinoco, and C. Bustamante, 
Nature 437, 231 (2005). 

[50] T. Hatano and S. -i. Sasa, Phys. Rev. Lett. 86, 3463 (2001). 

[51] U. Seifert, Phys. Rev. Lett. 95, 040602 (2005). 

[52] T. Speck and LI. Seifert, J. Phys. A: Math. Gen. 38, L581 (2005). 

[53] U. Seifert, Rep. Prog. Phys. 75, 126001 (2012). 

[54] K. Sekimoto, J. Phys. Soc. Jpn. 66 (1997). 

[55] C. Jarzynski, J. Stat. Mech.: Theor. Exp. , P09005 (2004). 

[56] K. Muruyama, F. Nori, and V. Vedral, Rev. Mod. Phys. 81, 1 (2009). 

[57] T. Sagawa, Thermodynamics of Information Processing in Small Systems (Springer, 

2012 ). 

[58] M. A. Nielsen and I. L. Chuang, Quantum Computation and Quantum Information 
(Cambridge LIniversity Press, 2000). 

[59] T. Sagawa and M. Lleda, Phys. Rev. E 85, 021104 (2012). 

[60] D. Abreu and U. Seifert, Phys. Rev. Lett. 108, 030601 (2012). 

[61] S. Lahiri, S. Rana, and A. M. Jayannavar, J. Phys. A: Math. Theor. 45, 162001 

( 2012 ). 


136 



[62] J. Sung, arXiv cond-mat, 0506214 (2005). 

[63] R. C. Lua and A. Y. Grosberg, J. Phys. Chem. B 109, 6805 (2005). 

[64] S.-i. Sasa, private communication. 

[65] K.-H. Kim and S. W. Kim, Phys. Rev. E 84, 012101 (2011). 

[66] R. Kawai, J. M. R. Parrondo, and C. Van den Broeck, Phys. Rev. Lett. 98, 080602 
(2007). 

[67] J. M. R. Parrondo, C. Van den Broeck, and R. Kawai, New. J. Phys. 11, 073008 
(2009). 

[68] J. W. Gibbs, Elementary Principles in Statistical Mehcnics (Yale LIniversity Press, 
1902). 

[69] R. P. Feynman, Statistical Mechanics: A Set of Lectures (Westview Press, 1972). 

[70] D. Zubarev, Nonequilibrium Statistical Thermodynamics (Plenum Pub. Corp., 1974). 

[71] L. D. Landau and E. M. Lifshitz, Statistical Physics (Butterworth-Heinemann, 1975). 

[72] M. Toda, R. Kubo, and N. Salto, Statistical Physics I: Equilibrium Statistical Me¬ 
chanics (Springer, 1978). 

[73] H. B. Callen, Thermodynamics and an Introduction to Thermostatistics (John Wiley 
& Sons, 1985). 

[74] K. Funo, Y. Murashita, and M. Ueda, arXiv , 1412.5891 (2014). 

[75] G. Verley, M. Esposito, T. Willaert, and C. Van den Broeck, Nature Comm. 5, 4721 
(2014). 


137 



